Combining dynamical decoupling with fault-tolerant quantum computation 
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We study how dynamical decoupling (DD) pulse sequences can improve the reliability of quantum 
computers. We prove upper bounds on the accuracy of DD-protected quantum gates and derive 
sufficient conditions for DD-protected gates to outperform unprotected gates. Under suitable con- 
ditions, fault-tolerant quantum circuits constructed from DD-protected gates can tolerate stronger 
noise, and have a lower overhead cost, than fault-tolerant circuits constructed from unprotected 
gates. Our accuracy estimates depend on the dynamics of the bath that couples to the quantum 
computer, and can be expressed either in terms of the operator norm of the bath's Hamiltonian or 
in terms of the power spectrum of bath correlations; we explain in particular how the performance 
of recursively generated concatenated pulse sequences can be analyzed from either viewpoint. Our 
results apply to Hamiltonian noise models with limited spatial correlations. 
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I. INTRODUCTION 

Two well-known methods for protecting quantum sys- 
tems from noise are dynamical decoupling (DD) and 
quantum error correction (QEC). In DD, pulses are ap- 
plied to the protected system, chosen so that the dam- 
aging effects of the noise nearly average away. In QEC, 
protected logical qubits are encoded as collective states 
of many physical qubits, chosen so that damage due to 
noise can be detected and reversed. 

Each method has advantages and disadvantages. On 
the plus side, resource requirements for DD are relatively 
modest |ll4l3|. Only unitary control operations need 
be applied to the system; there is no need to perform 
measurements or to replace used ancillary qubits with 
fresh qubits. Furthermore, a single physical qubit suf- 
fices for each protected logical qubit, and protected quan- 
tum gates can be implemented using relatively short se- 
quences of pulses. DD pulse sequences are simple enough 
that experiments on a wide variety of quantum systems 
have convincingly demonstrated the effectiveness of DD 
p^ - l25j . On the minus side, DD is effective only against 
low-frequency noise, slowly varying on the time scale set 
by the interval between pulses, and its effectiveness is 
intrinsically limited by imperfections in the timing and 
shape of the pulses. Furthermore, DD is not an efficient 
scheme for flushing entropy from the system, if no qubits 
are replaced or refreshed; thus it seems that DD docs 
not by itself provide a feasible route to scalable quantum 
computing. 

For QEC, on the plus side, the quantum accuracy 
threshold theorem establishes that QEC, through ju- 
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dicious design of fault-tolerant gadgets acting on code 
blocks, suffices for accurate simulation of arbitrarily long 
quantum computations, if the noise is sufficiently weak 
and reasonably local |26l - l33l |. QEC can succeed against 
high-frequency noise, where DD methods fail. On the mi- 
nus side, though, the resource requirements for QEC are 
quite daunting. A ready supply of fresh qubits is neces- 
sary; furthermore, the number of physical system qubits 
needed to encode one logical qubit, and the number of 
physical gates needed to execute one logical gate, can be 
substantial. Because of the complexity of fault-tolerant 
quantum computing protocols, and because these pro- 
tocols work only when the noise is already quite weak, 
experiments showing that QEC can suppress naturally 
occurring noise have not yet been performed. 

Because of their complementary strengths, DD and 
QEC used together should be more effective at protect- 
ing quantum computers from noise than either used by 
itself. Combining these two methods of error control is 
the topic of this paper. Hybrid schemes combining DD 
with QEC have been proposed previously [s^ - lsij . and 
even studied experimentally [37| . Our new contribution 
is a systematic investigation of the advantages of hybrid 
schemes for fault-tolerant quantum computing, including 
rigorous bounds on performance. 

Our main technical results are analytic expressions for 
the "effective noise strength" of quantum gates imple- 
mented using DD pulse sequences. The effective noise 
strength is (an upper bound on) the deviation in the op- 
erator norm of the noisy protected gate from an ideal 
gate. In the Hamiltonian noise models that we consider, 
the logarithm of the operator realized by a DD-protected 
gate can be expanded as a power scries (the Magnus 
expansion) in the noise Hamiltonian; we derive upper 
bounds on the sum of this series, obtaining formulas for 
the effective noise strength in terms of parameters in the 
noise Hamiltonian. We find such bounds both for general 
DD pulse sequences, and also for pulse sequences that 
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have an approximate time-reversal symmetry; in the lat- 
ter case the terms of even order in the Magnus expansion 
are heavily suppressed. 

Armed with our formulas for the effective noise 
strength, we derive a "noise-suppression threshold con- 
dition" on the noise parameters. When this condition is 
satisfied, DD-protccted gates are more accurate than un- 
protected gates. We also compare fault-tolerant quantum 
circuits composed from DD-protected gates with circuits 
composed from unprotected gates. In either case, we 
express the "accuracy threshold condition" on the noise 
parameters. When this condition is satisfied, quantum 
computation is scalable — accurate computations of ar- 
bitrary size can be performed with a reasonable overhead 
cost. Typically, improvements in gate accuracy achieved 
by DD mean that more noise can be tolerated by QEC 
combined with DD than by QEC alone, and that invok- 
ing DD can reduce the overhead cost of QEC. 

Our expressions based on the Magnus expansion for 
the effective noise strength depend on the operator norm 
of the Hamiltonian that governs the internal dynamics of 
the quantum computer's environment (the "bath"), and 
the results are not useful if this norm is large. But we 
also describe an alternative method of analysis yielding 
expressions for the effective noise strength in terms of 
the frequency spectrum of the bath correlations. Results 
derived by this method, based on the Dyson expansion, 
can be applicable even if the bath Hamiltonian has un- 
bounded norm, as long as the typical bath frequencies 
are sufficiently small. 

The performance of DD can sometimes be enhanced 
by using recursively generated "concatenated" pulse se- 
quences [l^. Adding an extra "level" to the recursive 
hierarchy further suppresses the effective noise Hamilto- 
nian, but at the cost of lengthening the pulse sequence, 
and the minimal effective noise strength is achieved by 
choosing the level that optimizes this tradeoff. We an- 
alyze concatenated DD sequences and estimate the op- 
timal effective noise strength, using both our bounds on 
the Magnus expansion and the correlation function view- 
point. 

Our analysis of the improvement in gate accuracy that 
can be achieved by combining DD and QEC applies only 
to a special class of Hamiltonian noise models. These 
models satisfy what we call the "local-bath assump- 
tion" which limits the spatial correlations in the noise. 
Whether our results can be extended to more general 
noise models that violate this assumption is an intrigu- 
ing open question. 

We formulate our noise model in SecHTl In Sec. lIIII and 
Sec. llVl we review and develop some of the tools we need 
to analyze the performance of DD pulse sequences and 
fault-tolerant quantum circuits using the Magnus expan- 
sion. We state our central results relating the effective 
noise strength of DD-protected gates to the properties of 
the noise Hamiltonian, and their implications concerning 
the noise-suppression threshold and accuracy threshold, 
in Sec. |Vl then wc apply these results to some specific 



pulse sequences in Sec. I VII Derivations of these results 
are contained in Sec. IVIII and the Appendices. We ana- 
lyze concatenated DD in Sec. IVIIII In Sec. IIXI we em- 
phasize that the effective noise can be related to intensive 
quantities that are independent of the spatial volume of 
the bath, and in Sec. |X] wc express the noise strength in 
terms of properties of bath correlations. Sec. IXII contains 
our conclusions. 



II. NOISE MODEL 

A. Noise Hamiltonian 

Wc denote by S the system consisting of all of the 
qubits in our quantum computer, and we describe the 
noise acting on S using a "noise Hamiltonian" iJ, which 
governs the joint evolution of the system and its envi- 
ronment, the bath B. During a computation, the Hamil- 
tonian also contains time-dependent terms that realize 
quantum gates acting on the qubits, but for now con- 
sider the case where there arc no gates; then H may be 
expressed as 

H = Hb+ i?err. (1) 

Here 

Hb^Is® Bo (2) 

describes the "free" evolution of the bath (how it would 
evolve if it were not coupled to the system) while 

Heir = Hg + HsB (3) 

includes all the terms in H that act non-trivially on the 
system. The term 

H°s = So® Ib (4) 

describes the unperturbed free evolution of the system; 
HsB contains terms coupling the system to the bath, and 
also perhaps other noise terms that act nontrivially only 
on the system. 

Though for some purposes it may seem natural to 
transform away Hg by working in the interaction pic- 
ture (that is, by considering the motion of the system 
relative to the rotating frame determined by Hg), we 
have included Hg in the term iJon- that represents the 
noise acting on the system. Our reason is that the DD 
sequences wc study arc designed to remove the effects 
of all "always-on" terms in the Hamiltonian that act on 
the system, i.e., not just Hsb but also the free evolution 
term Hg. We may by convention choose trs(i/orr) = 0, 
where trs denotes the system trace, since the trace of 
Hsb may be absorbed into the bath operator Bq, and 
the trace of 5*0 can be removed by subtracting a term 
proportional to Is (^Ib, which just shifts the zero point 
of the energy and has no dynamical effect. 
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Now consider modeling the noise during a nontrivial 
quantum computation. A computation is a circuit con- 
taining three types of operations: qubit state prepara- 
tions, unitary quantum gates, and qubit measurements. 
We model a noisy preparation as an ideal preparation 
followed by evolution according to H for a specified time 
interval, and we model noisy measurements as ideal mea- 
surements preceded by evolution according to H . We as- 
sume that quantum gates are executed using short, hard 
pulses, where, as in some experiments, the time interval 
between consecutive pulses is much longer than the pulse 
width. Each pulse has its support in a narrow interval of 
width 6, and we denote by rg the sum of the pulse width 
and the pulse interval (see Fig. [1]), where 5 ^ tq. To be 
concrete, we will sometimes assume that the pulses are 
perfectly "rectangular" — i.e., have vanishing rise-time 
and fall-time. However, the details of the pulse shape arc 
not really used in our analysis; rather, all that matters is 
that the pulse is confined to a narrow interval (and even 
this assumption will be relaxed in Sec. I VI C]) . We use the 
same noise Hamiltonian H to describe the noise both dur- 
ing a pulse and during the interval between pulses. We 
neglect errors in the timing and strength of the pulses; 
these are typically small in practice because the pulses 
are controlled by accurate classical circuitry. 



pulse width 







jjulse interval 

















FIG. 1: Parameters characterizing a sequence of uniformly 
spaced rectangular pulses: 5 is the pulse width, and tq — 5 is 
the interval between the end of one pulse and the beginning 
of the following pulse. 



noise Hamiltonian can be expressed as 

H = J2Ha, (5) 

a 

where the sum is over all locations occurring at a partic- 
ular time step, and where for any two distinct locations 
a and b in that time step. Ha and Hh act not only on 
disjoint sets of system qubits but also on distinct baths. 
That is, we may write 

Ha ~ Hs.a + Hcir^a, (6) 

with 

HB,a = Is,a (X" Bo^a, 

Hciyi.a — ^ ^ Sa^a ® B^.a-i (7) 
a 

where the operators Sa.a act on Qa, and where, for a ^ b, 
the bath operators Bo,a and Ba.a associated with loca- 
tion a commute with the bath operators Bqj, and Ba.t 
associated with location 6. Thus [Ha, Hi,] = for all 
location pairs a and b. Each Ha is assumed to be time- 
independent during the duration of location a (this as- 
sumption is helpful because DD pulse sequences are typ- 
ically designed to cope with a time- independent noise 
Hamiltonian), but Hamiltonians at different locations 
need not be the same. 



c 
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B. Local- bath assumption 

To further simplify our analysis, we make an additional 
assumption about the noise, which we call the local-bath 
assumption [s^l, illustrated in Fig. [2] Let us use the term 
"location" to speak of an operation in a quantum circuit 
that is performed in a single time step — a location may 
be a single-qubit or multi-qubit gate, a qubit preparation 
step, a qubit measurement, or the identity operation in 
the case of a qubit that is idle during the time step. Each 
time step has duration tg; thus to = Ntq if N equally 
spaced pulses are applied at a particular location. For a 
specified location labeled by a, let Qa denote the set of 
qubits that participate in the operation applied at that 
location (for example, a pair of qubits if the operation is 
a two-qubit gate). Under the local-bath assumption, the 



FIG. 2: (color online) Illustration of the local-bath assump- 
tion. Solid (blue) lines are system qubits, and dashed (black) 
lines are bath subsystems. Each unfilled rectangle is a quan- 
tum gate, and its associated filled (light grey) rectangle repre- 
sents the joint evolution of system qubits and bath subsystems 
that are coupled by the noise Hamiltonian. The filled rect- 
angles do not overlap, indicating that when two gates act in 
parallel on distinct sets of system qubits, the associated bath 
subsystems are also distinct. 

The local-bath assumption allows us express the time 
evolution operator for a single time step as a product 
of unitary operators, each associated with one particu- 
lar location, and to analyze the effectiveness of the DD 
pulse sequence for each location separately. Without this 
assumption, a rigorous analysis of DD-improved fault- 
tolerant circuits would be far less tractable. We expect 
our local-bath model to be a reasonable approximation 
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to the noise in actual systems, if qubits are well isolated 
from one another when they are not coupled by quan- 
tum gates. However interactions between qubits (and 
their associated baths) at different circuit locations are 
surely present at some level, and in Sec. IIXI we will com- 
ment further on how our analysis is affected when the 
local-bath assumption is relaxed. 



C. Noise parameters 

To characterize the noise strength, it is useful to intro- 
duce the parameters /3, J, e defined by: 



/3 = maxjli^B ail, 

a 

J = max||i7crr,a||, 
a 

e = l3 + J> max||iJa||. 

a 

The norm here is the sup operator norm 



(8) 
(9) 
(10) 



(11) 



where the vector norm is the Euclidean norm — 
y/ {v\v). Actually, our results concerning the effective- 
ness of DD pulse sequences apply for any norm that is 
unitarily-invariant (and therefore also submultiplicativc 
[ssj). but the operator norm will be used to relate these 
results to the accuracy threshold for fault-tolerant quan- 
tum computing [sO, [SH . We are typically interested in 
the case where the noise is weak, in the sense that the di- 
mensionless parameter erg is small compared to one (and 
hence also Jtq ^ 1 and /3to ^1). We will derive bounds 
on the performance of DD-protcctcd quantum gates ex- 
pressed in terms of these small quantities, and also in 
terms of the dimensionless pulse width S/tq <^ 1 . 

For our analysis of fault-tolerant circuits, we will find 
it convenient to assume that measurements and prepara- 
tions are at least as fast as pulses, i.e., can be executed 
in time at most S. But in Sec. IV Dl we will discuss how 
to interpret our results if measurements or preparations 
take much longer than pulses. 



A. Toggling frame 

For now, disregard that we want to do computation, 
and focus instead on quantum storage — the original 
context for DD methods. In the absence of any ex- 
ternal control, the system and bath evolve under the 
time-independent noise Hamiltonian H. A DD pulse se- 
quence is realized via a time-dependent control Hamilto- 
nian Hc{t) acting only on the system so that the system 
and bath evolve according to H + Hc{t). (In our noise 
model, wc assume that the same noise Hamiltonian H 
applies during a pulse as between pulses, while recogniz- 
ing that this assumption is really an idealization.) The 
DD sequence can be described using cither Hc{t) itself or 
using the time evolution operator Uc{t) = Uc{t, 0) gener- 
ated by Hc{t). 

For understanding the effects of the control Hamil- 
tonian, it is convenient to use the interaction picture 
defined by HJt), also known as the toggling frame 
[ESS EIj HJI- The toggling- frame density operator 
Pssit) is related to the Schrodingcr-picture density op- 
erator psB{t) by 



PSB{t) = U{t,0)psB{0)U^{t,0) 
= U,{t)~psB{t)Ul{t), 



(12) 



where U{t, 0) is the evolution operator generated by the 
full Hamiltonian H+Hc{t). Therefore the toggling-frame 
state evolves according to 

PsBit)^U{t,0)psBiO)UHt,0), (13) 

where the toggling-frame time evolution operator 

Uit,0) = Ulit)U{t,0) (14) 

is generated by the toggling-frame Hamiltonian 

H{t) = U}{t)HUc{t). (15) 

Since Uc{t) acts nontrivially only on the system, H{t) 
can be written as 



H{t)=HB+H,„{t), 



(16) 



III. TOOLS 

Let us next review some tools for analyzing the noise 
suppression arising from DD techniques. We focus here 
on the foundations of our analysis based on the Magnus 
expansion; further background, needed for our analysis 
based on bath correlation functions, will be discussed 
in Sec. [X] We also provide here a brief discussion of 
fault tolerance, including the notion of the effective noise 
strength at a circuit location, a central quantity in our 
analysis. 



where Hc„{t) = Ul{t)Hc„Uc{t) is the togghng-frame ver- 
sion of Hcvr- Because the operator norm is unitarily- 
invariant, we have ||iJ(<)|| = \\H\\ < e and ||i7crr(^)!l = 

IliJerrll < J- 

We consider cyclic DD, where Uc{t) returns to the iden- 
tity (up to a possible irrelevant overall phase) at the end 
of a cycle taking time ^dd: 

UcitBo) = UciO) = I. (17) 

Therefore, at the end of the cycle, the toggling-frame and 
Schrodinger-picture states coincide. 
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B. Finite-width pulses 

In DD, the system is controlled using a sequence of 
pulses, where the control Hamiltonian Hc(t) vanishes in 
between the pulses. The control unitary resulting from a 
sequence of R pulses can be expressed as 

Uc = IPrIPr-iI...P21PiI. (18) 

where Pk is the unitary achieved by the kth pulse. We 
have inserted the identity I between successive pulses to 
indicate the time intervals during which Hc{t) = 0. For 
some pulse sequences, including the ones described in 
Sec. I VII all pulse intervals have the same duration, but 
for most of our analysis (excluding some of the discussion 
of pulse- width effects in Sec. I VII A ip we need not assume 
that the pulses are uniformly spaced. (It is known that 
the effectiveness of DD can sometimes be improved by 
varying the spacing between pulses l2ll [39l - l42| . ) 

If the pulses are rectangular with width S, then we may 
write 

Pk = eM-^SHpJ, (19) 

where iJp^. is the time-independent control Hamiltonian 
that is turned on during the fcth pulse. If the fcth pulse 
begins at time Sfe, then the control unitary during the 
pulse {t G [sfe, Sfc + S)) is 

Ucit) = cxp (-lAkHp,) Ucisk) 

= exp {-lAkHp^ ) Pfc-i . . . P2P1 , (20) 

where = sj.. The toggling-frame Hamiltonian H{t) 
can be written as 

H{t)^UUt)HU,{t) (21) 

_jH^''-^'> fort e [sfe_i+J,Sfc), 

[ e ^ ^fc H^" ^>e for < e [sfc, Sfc + (5), 

where 

^ pIp^^ pl_^HPk-i . . . P2P1. (22) 

In the case of cyclic DD, after the last pulse of a complete 
cycle we have C/c = I and H — H. 

C. Magnus expansion 

For a given H{t), the toggling- frame time evolution op- 
erator U{tDD, 0) can be computed using a Magnus expan- 
sion (43| . For a unitary time evolution operator UM{t, 0) 
satisfying the Schrodinger equation 

^^C/A/(^,0) =iJM(i)t^M(t,0), f/A/(0,0)=I, (23) 

determined by Hamiltonian HM{t), the Magnus expan- 
sion at time T is an operator series 

00 

n{T) = J2^n{T) (24) 

n=l 



such that 

[/A/(T,0) = exp[17(r)], (25) 

and rin is nth order in the Hamiltonian H]\j{t). Thus, 
for the fixed time T, time evolution generated by the 
time-dependent Hamiltonian Hm (t) is equivalent to time 
evolution generated by the time-independent effective 
Hamiltonian iJoff = ^i^{T). 

The leading terms in the Magnus expansion are (see 
for example, [4J|) 

ni{T) = -i[ dsHMis), (26) 
Jo 

n2{T)^-]-( ds, [ ' ds2[HM{si),HM{s2)], (27) 
^ Jo Jo 



n3(T) = 




{[HM{si),[HAlis2),HMis3)]] 
+ [HMis3),[HMis2),HMisi)]]). 

Higher-order terms can be computed using a recursive 
formula; sec Scc. lVIII and Appcndix[3 In general, i7„(T) 
is the time integral of a sum of (n — l)-nested commu- 
tators, each with n factors of HM{t). The Magnus ex- 
pansion is thus an infinite series in H]\jT; a sufficient 
condition for convergence is [i^ 

/ dt ||Fm(0|| < 7:. (29) 
Jo 

For cyclic DD, we can use the Magnus expansion 
to compute the toggling-frame time evolution operator 
U{tuu,0) for one complete cycle, where HM{t) is the 
toggling-frame Hamiltonian H{t) ~ Ul{t)HUc{t) and 
UcitDi)) = I. In first order we obtain 

f^i(iDD) = / dt Hit) 
Jo 

= -i I dt Ul{t)HUc[t). (30) 
Jo 

For group-based DD schemes, like the examples we will 
discuss in Sec. lVIl the integral Eq. (PH]) averages H over a 
finite group Q if the pulses are ideal, projecting H into the 
commutant oi Q Q . If ^ acts irreducibly on the system 
Hilbert space, the commutant contains only the identity 
operator acting on the system, and therefore rii(iDD) 
acts nontrivially only on the bath. In that case we say 
that rii(iDD) is a "pure bath" term. 

We say that a DD pulse sequence achieves first-order 
decoupling if the first-order term in the Magnus expan- 
sion for Uc{t-o-D) acts trivially on the system. More 
generally, the sequence achieves mth-order decoupling if 
f^n(iDD) is a pure bath term for each n < m. In our 
analysis we will at first consider pulse sequences that 
achieve first-order decoupling for ideal zero-width pulses 
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(later we will discuss the corrections to first-order de- 
coupling that arise when the pulses have nonzero width, 
and we will also describe "Eulerian" pulse sequences that 
achieve first-order decoupling even when pulse widths are 
nonzero Q)- In particular, these pulse sequences have 
the property 

/ °° dt #orr,o(i) = 0, (31) 
Jo 

where the subscript "0" on Hcn-.oit) indicates that the 
toggling- frame Hamiltonian Hcri{t) is considered in the 
limit S 0, while holding to fixed. (For Eq. (|3T|) to apply 
there must be no term in iJcn-.o that acts nontrivially on 
the system and commutes with Hc{t) for all t £ [Ojioo]; 
if such terms were present they would not be removed 
by the DD sequence described by Hc{t).) For pulse se- 
quences satisfying Eq. ([31]) it follows from Eq. that 
the first-order term in the Magnus expansion is 

^litDo) = -I / dt Hoit) 
Jo 

= -iHBtBB - i I dt Hcir.o{t) 

Jo 

= -iHstuB, (32) 

a pure bath term, when 6 — 0. For pulses with nonzero 
width (5, first-order decoupling is not exact, but the de- 
viation of f2i(tDD) from a pure bath term is 0{S/to) and 
thus small when the pulses are sufficiently narrow. For 
suitably designed pulse sequences the deviation can be 
improved to a higher power of S/tq 0, SI]- 

A pulse sequence that achieves first-order decoupling 
will also achieve second-order decoupling if H is time- 
symmetric: H{tui) - t) = H{t) for t € [0,tDD]- This 
condition is satisfied provided 

Ucituu -t) = VtUcit), (33) 

where Vt is unitary and commutes with H (for example, if 
Vt = e"^'I is a phase). In fact, when H is time-symmetric, 
not just the second-order term, but all even terms in the 
Magnus expansion vanish (47j . as we show in Appendix 

m 



D. Quantum accuracy threshold theorem 

The quantum accuracy threshold theorem establishes 
that a noisy quantum computer can operate reliably if 
the noise is sufficiently weak. Under the local-bath as- 
sumption formulated in Sec. |lll the operation applied at 
location a in the noisy circuit is a unitary transforma- 
tion Ga acting on the system and bath, which can be 
expressed as 

Ga^Ga+Ba. (34) 



Here Ga is the "good part" of the operation; it can be 
expressed as Ga® where Ga is the ideal operation 
that would be applied to the system in the absence of 
noise, and Ba is a unitary transformation acting on the 
bath. The operator Ba is the "bad part," the deviation 
of Ga from the ideal operation, which acts jointly on 
system and bath. (Recall that wc model a noisy qubit 
preparation as an ideal preparation followed by a noisy 
unitary transformation, and a noisy qubit measurement 
as a noisy unitary transformation followed by an ideal 
measurement; for preparation or measurement locations, 
Ga denotes the noisy transformation that follows or pre- 
cedes the ideal preparation or measurement.) In this 
noise model, we may characterize the noise strength by 

= max||Sa|l, (35) 

a 

the maximum value of the operator norm of the bad part, 
where the maximum is with respect to all locations in the 
noisy circuit. The threshold theorem asserts that an ideal 
circuit of arbitrary size can be simulated accurately if 77 
is less than a critical value 770, the accuracy threshold. 
The threshold theorem proved in [sij actually applies to 
a broader class of noise models that do not necessarily 
satisfy the local-bath assumption, but this class includes 
the noise model of Sec.|ll]as a special case. The analysis 
in [4^ established a lower bound on the accuracy thresh- 
old, ?7o > lO"**. If 77 < ?7o, then we say the noise is below 
the accuracy threshold, meaning that scalable quantum 
computing is possible. 

In this paper we will relate the noise strength 77 defined 
by Eq. and Eq. ([55]) to the parameters that charac- 
terize the noise model defined in Sec. |lTl We denote by 
7;dd the value of 77 that can be achieved using dynamical 
decoupling, and we denote by 77 the value of 77 achieved 
without using dynamical decoupling. If 77DD < ?7, then we 
say that the noise is below the noise suppression thresh- 
old^ meaning that dynamical decoupling reduces the ef- 
fective noise strength. 

In Sec. |V] we express 7700 h^ terms of the parameters 
J and e defined in Eq. ([Sjl- pH]) . In Sec. |X] we express 
77DD in terms of properties of bath correlation functions, 
using a different formula than Eq. based on the 

Dyson expansion. 

IV. DD-PROTECTED GATES 

A. Including the gate pulse 

So far wc have described how to reduce the noise in 
a quantum memory using cyclic DD. Now wc want to 
estimate the effective noise strength achieved by DD for 
operations other than the identity, so we must explain 
how DD is used to suppress the noise in these nontrivial 
operations. We will describe nontrivial quantum gates, 
postponing discussion of preparations and measurements 
until later. 
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We refer to one cycle of the DD pulse sequence for the 
identity gate as the "memory" sequence. To perform a 
DD-protected nontrivial gate Ga, wc must modify the 
memory sequence accordingly. In fact our DD pulse se- 
quence for the gate is exactly the same as the memory 
sequence, except for the very last pulse. If the memory se- 
quence of R pulses ends with a period of trivial evolution, 
then we append a pulse implementing Ga to the end the 
memory sequence. Thus, if the memory sequence lasts 
time toD and the pulse width is S, then the Ga pulse se- 
quence lasts time to = tuD + S and uses = pulses. 
If on the other hand the i?-pulse memory sequence ends 
with a nontrivial pulse implementing P, then we combine 
this pulse and the gate pulse into a single pulse imple- 
menting GaP- Again, we denote the total time for the 
Ga pulse sequence by to, and the total number of pulses 
by R). 

While wc assume for simplicity that every pulse has 
the same width S, we recognize that in some cases differ- 
ent types of pulses may have different time scales. For 
example, in recent experiments with quantum dot qubits, 
X gates arc implemented using (fast) exchange couplings 
and Z gates are implemented using (slow) magnetic field 
gradients • One may interpret S as the duration of the 
longest pulse used, or one could easily refine our analysis 
by allowing different pulses to have different widths. 

In a DD-protected circuit, each Ga gate is replaced by 
the corresponding DD-protected gate; under the local- 
bath assumption, the noisy DD-protected gate is a uni- 
tary transformation denoted Ga acting jointly on the sys- 
tem qubits involved in the gate and the associated bath 
subsystem. Though the duration to of a DD-protected 
gate is longer than the duration To of an unprotected 
gate, the DD-protected gate may be more accurate than 
the unprotected gate, if the noise is weak enough. 

At the end of the complete Ga pulse sequence, the uni- 
tary operator J7c,a(to) generated by the control Hamilto- 
nian Hc{t) (which now includes the gate pulse) is 



C^c,a(to) — GaUc{tBD) — Ga, 



(36) 



because the cyclic memory sequence satisfies C/c(tDD) = I 
(up to a possible phase). Therefore the noisy DD- 
protected gate at location a is 

Ga = Ua{to,0) - UcAtn)Ua{to,0) = GaUa{to,0), (37) 

where C/a(to,0) is the toggling- frame time evolution op- 
erator. The corresponding toggling-frame Hamiltonian is 
similar to the toggling-frame Hamiltonian Eq. (|2ip for the 
memory sequence, except for the appended gate pulse: 



Ha{t) - Ula{t)HaUUt) 



(38) 



'-fc fortg [sfc,SfeH- (5), 

[ G\HaGa for t = to. 



Eq. ((38)) applies to the case where the gate pulse is ap- 
pended to the end of the memory sequence; the mem- 
ory sequence contains R equally spaced pulses labeled by 
fc = 1, 2, . . . , i?, and the gate pulse begins at time spt+i- 
The DD-protected qubit measurement is the memory 
pulse sequence followed by an ideal measurement. We 
assume that the measurement takes time 5, the same as 
the pulse width, so that the duration to of the protected 
measurement matches the duration of the DD-protected 
gate. Similarly, the DD-protected qubit preparation is 
an ideal preparation lasting time 5 followed by the mem- 
ory pulse sequence. See Sec. IV Dl for discussion of how 
our analysis is modified when preparations and measure- 
ments are slow compared to other operations. 



B. Effective noise strength 

To define the effective noise strength for the DD- 
protected gate, we divide the noisy gate into a good part 
and a bad part as in Eq. p4p . obtaining 

Ga ^GaUB,a{t^) + Ga_GaUBM^ (39) 

The good part Qa describes the ideal evolution in the 
absence of noise (iJon- = 0) — the ideal gate Ga is ap- 
plied to the system, while the bath evolves according to 
its unperturbed Hamiltonian Hs^a- The bad part Ba de- 
scribes the effects of noise, as modified by the DD pulse 
sequence. 

As we discuss in more detail in Sec. I VII B| we may 
choose a different way of separating the pure bath dy- 
namics into a good and bad part than the choice made 
in Eq. p9p . Incorporating UB,aito) into Qa is convenient 
when we use the Magnus expansion to analyze the perfor- 
mance of DD-protected gates, but another choice is more 
convenient for the analysis based on the Dyson expansion 
in Sec lVHB] 

Using Eq. p7p and the unitary invariance of the oper- 
ator norm, we obtain an expression for the noise strength 
of the DD-protected circuit: 

?7DD = max||Sa|| = max Ua{to,0) - UbA^o) ; (40) 

a a 

this is just the norm of the bad part expressed in the tog- 
gling frame. In what follows, we will sometimes drop the 
subscript a when context makes the intended meaning 
clear. 

We can now estimate t^dd using the Magnus expansion. 
We write 

i7(to, 0) = cxp[l](to)] = exp[-itoH,ff], (41) 

where Hcs = ^fi(to), and 17 (tg) can be computed using 
the (gate- appended) toggling- frame Hamiltonian H{t) in 
Eq. dSHl). To bound the quantity \\U{to,0) - C/s(to)||, we 
make use of Lemma [3] in Appendix [Cl which gives: 



|[/(to,0)-C/i3(to)|| < to||i/cff-i?B| 



(42) 
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Inserting the Magnus expansion —itoHcS = ^{to) = 
Y.n=i ^n{to) we find 

f?DD < iomax||i7off,a - -ffs.all 

a 

oo 

= max II Qn.ai^o) + ito^sA 

a ^ — ^ 

oo 

< max {\\n[^,{to)\\ + \\nnAto)\\), (43) 

where fi'j^ ^(t) = + itHs.a- For a pulse sequence 

that achieves first-order decoupling with ideal zero- width 
pulses, Vl'i a(io) vanishes in the limit (5 — > 0. To derive a 
useful upper bound on the effective noise strength jyoD, 
we will need good bounds on the other terms in Eq. (|43| . 



C. The effective noise strength for a 
time-symmetric sequence 

We say that the memory pulse sequence is time- 
symmetric (or "palindromic") if ff(tDD ~ t) = H{t) for 
t e [0,iDD]- We will show in Appendix IB] that a time- 
symmetric pulse sequence that achieves first-order decou- 
pling also achieves second-order decoupling. However, 
the time symmetry is broken if we construct the DD- 
protected gate by appending the gate pulse to the mem- 
ory sequence, even if the memory sequence by itself is 
time-symmetric. 

For the purpose of estimating the effective noise 
strength, we can nearly restore the time symmetry of 
the DD-protected gate by a simple trick (see Fig. [3|. 
Recalling that our goal is to derive an upper bound on 
||C/(io,0) — UB{to)\\, we observe that the unitary invari- 
ance of the operator norm implies 



\\U{h,0)-UB{ta)\\ 



\U{t^,Q)ul{5)- 



-UB{h-5)\l (44) 



where 5 is the width of the gate pulse, and = ^dd + ^■ 
Furthermore, we may regard tj{to,Q)U\{6) as the time 
evolution operator generated by the Hamiltonian 



HM{t) 



-He te[0,(5), 
H{t-5) te [6,T], 



(45) 



where T ^ Iq + 6. If the memory sequence is time- 
symmetric, then Hm{T - t) = HM{t) for t e [5,T - 6]. 
Thus H]\i{t) is "nearly time-symmetric" in the interval 
[0, T]; the symmetry is broken only in the small intervals 
[0, 6] and [T - 5, T] at the beginning and end of [0, T]. 

The unitary operator U{to,0)U^{S) = exp[ri(T)] can 
be computed using the Magnus expansion for Hamilto- 
nian HM{t). Viewing U{to,Q)Ug{S) as being generated 
by the time-independent Hamiltonian iQ{T)/(tQ — 6) for 
time to — 5, and again using Lemma |3] in Appendix [Cl we 



obtain instead of Eq. (|43p . 

??DD = max \\IJa{to, 0)1/1^5) - UbA^o - 
< max\\naiT) + liT - 2S)HB.a\\ 



m^x(\\n[jT)\\ + J2\\^nAT)\\ 



71=2 



(46) 



where n[ ^{T) is now defined as n[ ^{T) = Qi aiT) + 

i{T-2S)HB,a- 

More generally, we say that the Hamiltonian HM{t) 
is nearly time-symmetric in [0, T] if the time symmetry 
holds everywhere except in a small interval or the disjoint 
union of several small intervals. We denote by A the 
region in which the time symmetry is violated; thus 



HM{T-t) = HM{t) for i ^ A 
HM{T^t)^HM{t) fort e A. 



(47) 



We also use the same symbol A(<C T) to denote the total 
length of this region. Thus A = for a perfectly time- 
symmetric sequence. In what follows, we will sometimes 
say that the pulse sequence realizing a DD-protected 
gate is "time-symmetric" if the corresponding memory 
sequence is time-symmetric, even though the time sym- 
metry may be broken by the gate pulse appended to the 
memory sequence. We say that the memory sequence 
and the DD-protected gates are "general" if the memory 
sequence has no special time symmetry properties. 



V. EFFECTIVE NOISE STRENGTH AND 
THRESHOLD CONDITIONS 

In this section, we state some of our conclusions con- 
cerning the effective noise strength t^dd achieved by 
dynamical decoupling, and the implications for fault- 
tolerant quantum computing. Derivations will be post- 
poned until Sec. lVIII Here we focus on results derived us- 
ing the Magnus expansion; results relating ?7dd to proper- 
ties of bath correlation functions derived using the Dyson 
expansion are discussed in Sec. [X] 



A. Bounds on the Magnus expansion 



Combining Eq. (|43| and Eq. (|46|). we can state our 
upper bound on the effective noise strength ?7dd as 



vDD<\mT)\\+j2mn{T)\\, 



(48) 



where n[{T) = ni{T) + i{T ~2T)Hb, T = to + T, and the 
maximum over all locations is understood. The Magnus 
expansion f2(r) = X]ri=i ^n{T) is computed using the 
Hamiltonian 



HM{t) = 



-Hb ie[o,r), 

H{t -T)=Hb + Hcrrit -r) te [T, T]. 



(49) 
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FIG. 3: Schematic representation of HM[t) for time-symmetric DD pulse sequences. The time axis is bent in half, with time 
flowing counterclockwise from the upper right to the lower left, so that times aligned on the upper and lower branches are related 
by time symmetry, (a) Two pulses (marked as black boxes) in a time-symmetric memory sequence with H{tou — t) = H{t); 
the pulse on the bottom branch is the time-reversed version of the pulse on the top branch, (b) Appending the gate pulse (box 
G) to the memory sequence spoils the time symmetry; the black pulses on the upper and lower branches are no longer aligned, 
(c) Appending fictitious time evolution governed by —Hb during t G [0, 5] (box B) restores the time symmetry of the memory 
sequence for t £ [S,T — 5], where T = to + 5. 



For the general case, in which we are not trying to exploit 
the time symmetry of the memory sequence, we choose 
r = 0. For the nearly-timc-symmctric case we choose 
r = (5, and Hm is time-symmetric in the interval [S, T—S] . 

If the memory sequence achieves first-order decoupling 
in the limit (5 — > 0, then ||f2']^(T)|l vanishes apart from 
finite-width corrections. The nth-order Magnus term 
riniT) for n > 2 satisfies ||f2„(r)|| = O ((eT)"), because 
||-ffM(OII — ^1 ^-iid the integral fi„(T) can be bounded 
by the volume of the integration region times an up- 
per bound on the integrand. In fact, this estimate can 
be improved to ||f2„(T)|| = O (( jr)(eT)"-i) , because 
HM{t) has the form ±Hb + H'{t) where H'{t) is either 
or Hcii{t); therefore \\[HM{ti), HM{t2)]\\ = 0{Je), since 
£ J and Hb commutes with itself. 

We anticipate, then, that at any location a, the terms 
in the Magnus expansion can be bounded as 

\\n[{T)\\ < C,{JT)- (50a) 

||a^(r)|| <C7„(JT)(er)"-i, n = 2,3,4; (50b) 

oo 

Y.\\n,,{T)\\<C5{JT){eT)\ (50c) 

n — 5 

where Ci, C2, C3, C4, C5 are constants. Note that the last 
of these results bounds the sum of all high-order Magnus 
terms with n > 5. Combining Eq. and Eq. ([50)) we 
find 

5 

?7DD < (JT)^C7„(eT)"-i. (51) 

n=l 

The constants C„, derived in Sec. lVIIl are listed in Table 
H] for both general and time-symmetric pulse sequences. 
Our value of C5, obtained by bounding an infinite series, 
holds only for eT < 0.54, a condition likely to be satisfied 
when DD works effectively. If desired, tighter bounds 
can be derived on the nth order terms with n > 5 using 





General 


Nearly time symmetric 


Dyson (General) 


Ci 


2NS/T in general, 

N5/T if pulses are regularly spaced in time 


C2 


1/2 




1 


C3 


2/9 


2/9 


1/2 




11/9 


56(#)(l-i#) 


1/6 


C5 


9.43 


9.43 


.0466 



TABLE I: Constants {Cn} appearing in the upper bound 
Eq. (|51[l on the effective noise strength r?DD, derived from 
the Magnus expansion in the general case and the nearly- 
time-symmetric case, and from the Dyson expansion in the 
general case. denotes the total number of pulses in the 
DD-protected gate, 5 is the pulse width, T = to in the gen- 
eral case, and T = to + 5 m the nearly-time-symmetric case. 
For the nearly-time-symmetric case, A is the size of the small 
region in which the time symmetry is violated. The value of 
C5 applies assuming tT < 0.54. 



results from Sec. IVIIl However, we judge Eq. (|50cp to 
be good enough for our purposes, since this bound on 
the sum of higher-order terms is already rather small for 
eT ^ 1, as in typical cases of interest. Also listed in 
the last column of Table U are values of {C,i} derived 
in Sec. IVIIl using the Dyson expansion rather than the 
Magnus expansion, also under the assumption eT < 0.54. 
These upper bounds are weaker for n = 2, 3 but stronger 
for n = 4, 5, and hence provide a tighter estimate of the 
effective noise strength for pulse sequences that achieve 
third-order decoupling. 

Our bounds on 0„(T) for n odd is not improved by 
invoking time symmetry, but for n = 2,4, the bounds 
Hsted in Table U are tighter for the time-symmetric case 
than the general case, assuming A/T ^ 1. In fact, C2 
and C4 vanish in the limit A/T — > 0, reflecting the prop- 
erty that all even-order terms in the Magnus expansion 
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vanish when the time symmetry is exact. For the time- 
symmetric case, we derive bounds on C„ for even n > 6 
in Appendix |d1 but these results were not used in our 
estimate of C5 . 

B. Noise suppression threshold 

DD-protected gates will outperform unprotected gates 
if the noise is weak enough. In a circuit of unprotected 
gates, each gate is realized by a single pulse, where the 
pulses are separated in time by the pulse interval tq. For 
the noise model of Sec. HIl the effective noise strength for 
this computation may be expressed as [sO, HH 

r] = (^max \\HsB.a\{) ^o- (52) 

Eq. ([52]) is not derived using the Magnus expansion; 
rather it follows directly from Lemma [3] in Appendix [C] 
The noise strength 77 does not depend on the pulse shape; 
all that matters is the strength of the noise Hamiltonian 
HsB,a and the time tq allotted for executing the gate. 
If we assume that H° = 0, Eq. ^ becomes 

r/ = Jro. (53) 

We say that the noise model satisfies the noise suppres- 
sion threshold condition if the effective noise strength can 
be reduced by using DD-protcctcd gates instead, i.e., if 

?7dd < ?7- (54) 

In our noise model, this condition can be expressed in 
terms of the parameters ero, S/tq and ro/to. 

For example, continuing to assume that Hg ~ 0, 
suppose in addition that (J/tq is negligible and eT is 
small enough so that the Magnus expansion is well- 
approximated by the lowest-order nonzero term. Then, 
in the general (non-time-symmetric) case, using Ci = 
and C2 = 1/2, we can approximate t^dd by 

we use the ~ symbol to emphasize that higher order cor- 
rections in (5/ro and tt^ are neglected. The noise sup- 
pression threshold condition 77DD < '7 = Jtq is satisfied 
for 

.r.<2(^)', ,56) 

or 

ero < 2iV-2 (57) 

for a sequence of equally spaced pulses. As the pulse 
sequence grows, the duration to of DD-protected gates 
increases relative to the duration tq of unprotected gates, 
and Eq. (|56|) imposes a stronger restriction on e. 



Note that ryoD depends on the norm of the bath Hamil- 
tonian /3 (which contributes to e), while r\ does not. Tech- 
nically, this difference comes about because the second 
order Magnus term exhibited in Eq. (j55p contains a con- 
tribution from the non-vanishing commutator between 
Use and Hb , while 77 is computed directly as a difference 
between the ideal and noisy Hamiltonians, differing only 
by (see Appendix [Ct . Physically, 77DD depends on 
/3 because dynamical decoupling works effectively only if 
the bath dynamics is sufficiently slow. Alternatively, we 
could estimate 77DD in terms of parameters other than 
/3 that characterize the speed of the bath dynamics; for 
example, we will derive in Sec. |X] an expression for 77DD 
involving the bath's frequency spectrum rather than the 
operator norm /3. 

In the limit of zero-width pulses, a time-symmetric 
pulse sequence that achieves first-order decoupling 
achieves second-order decoupling as well, so that C\ = 
C2 = 0. Imposing time symmetry may lengthen the 
pulse sequence; we denote the duration of a DD-protected 
time-symmetric gate by t'^, to contrast with the dura- 
tion io of the gate when the pulse sequence is not time- 
symmetric. In the time-symmetric case, the effective 
noise strength becomes (assuming 5 = and thus T = to, 
and using C3 = 2/9) 

Therefore the noise suppression threshold condition 
rjBB < Jto is satisfied if 

or 

ero<^{Nr'^' (60) 

for the case of N' equally spaced pulses. Even though 
to > to, Eq. ((59)) places a less stringent condition on e 
than Eq. provided tyto < (9to/2ro)i/3. We empha- 
size again that Eq. ([55| and Eq. ([5^ are derived using 
lowest-order approximations in an expansion in S/tq and 
e. 

The expression Eq. (jSSp for t^dd indicates that to 
achieve effective noise suppression we should favor short 
DD pulse sequences (with to /to not too large) to mini- 
mize the exposure to noise during the DD-protected gate. 
On the other hand Eq. (|58|) illustrates that a longer pulse 
sequence can pay off if it allows us to achieve higher- 
order decoupling. These results exemplify a more general 
tradeoff between shorter sequences and better decoupling 
that must be optimized to design DD-protected gates 
with the smallest possible effective noise strength. The 
tradeoff is also manifested by the analysis in Sec. IVIlfl of 
concatenated DD pulse sequences. 
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C. Accuracy threshold and overhead cost 



(b) 



threshold 
1 1— 

threshold 
H 1 



noise strength 



noise strength 



Furthermore our results are useful only if the Hamilto- 
nian of the local bath has finite norm (so that e < oo). 
However, we will see that the correlation function anal- 
ysis in Sec. |X] can provide useful upper bounds on 7700 
even if e is infinite. 



D. Slow preparations and measurements 



FIG. 4: Two scenarios where DD-protected gates outper- 
form unprotected gates, (a) Quantum computing is scalable 
with DD-protected gates, but not with unprotected gates, 
(b) Quantum computing is scalable with either DD-protected 
gates or with unprotected gates, but DD reduces the overhead 
cost of fault tolerance. 

A quantum computation unprotected by DD is scalable 
if the noise strength of unprotected gates is below the 
accuracy threshold, r\ < rjQ. For DD-protected gates, 
the accuracy threshold condition becomes 77DD < Vo- If 
the noise suppression threshold condition is satisfied, so 
that r^DD < Vi it may be that »7 > 770 and ?7dd < 1^o^ 
in that case, arbitrarily large quantum circuits can be 
simulated accurately with DD-protected gates, but not 
with unprotected gates. This is illustrated in Fig. Slja). 

Even when rj < rjQ, DD may reduce the overhead cost of 
fault-tolerant quantum computing if ?7dd < V [Fig-SJb)]. 
Suppose that we wish to simulate an ideal quantum cir- 
cuit containing L gates. If our noisy gates have noise 
strength 77, which is below the threshold value rip, the 
simulation is possible using L* noisy gates where [3l| 



L* ^ / log(c7?o£/0) \\ 
L \ log(?7o/?7) J 



(61) 



here c and a are constants, and 9 is the "error" in the 
simulation (the distance between the ideal probabil- 
ity distribution of outcomes and the simulated distri- 
bution). Denote by L*jj the number of pulses in the 
fault-tolerant circuit built from unprotected gates, and 
by L^Y) the number of pulses in the fault-tolerant circuit 
built from DD-protected gates, and suppose that each 
DD-protcctcd gate uses N pulses, while each unprotected 
gate uses a single pulse. Then the ratio 



L 



DD 



r * 



N 



logjilo/r]) 
\og{riQ / t]bd) 



(62) 



is independent of the size L of the simulated circuit. 
If using DD substantially improves the effective noise 
strength, may be small, especially if j] is only 

slightly below the threshold value 770. Even though each 
DD-protected gates requires multiple pulses, the total 
number of pulses used in the simulation may be reduced, 
because DD improves the gate accuracy. 

Of course, we have reached this conclusion using the 
local-bath assumption, which allows us to assign a well- 
defined effective noise strength to the DD-protccted gate. 



Another drawback of this analysis is that our model of 
qubit preparations and measurements may be unrealistic 
in some physical situations. In our estimates of the effec- 
tive noise strength in a DD-protected quantum computa- 
tion, we have treated preparations and measurements like 
gates. We have assumed that each preparation and mea- 
surement location in the circuit, like each gate location, 
has duration tQ. A DD-protected preparation location 
consists of a preparation taking time i5 followed by a DD 
memory sequence, and a DD-protected measurement lo- 
cation consists of a DD memory sequence followed by a 
measurement taking time S. Thus we have assumed that 
the preparations and measurements are just as fast as the 
pulses. In some physical systems, however, preparations 
and measurements are relatively slow; in solid-state de- 
vices, for example, the measurement time can be orders 
of magnitude longer than the gate time. 

If the actual time S required for a preparation or mea- 
surement is longer than the pulse width S but still short 
compared to the pulse interval tq, then we could still try 
to improve measurements and preparations using DD se- 
quences. If it makes sense to model the noise during 
a preparation or measurement as we have modeled the 
noise in the pulses, then we could modify our analysis 
by using the measurement width S in estimating the ef- 
fective noise strength ?7dd at preparation and measure- 
ment locations, while continuing to use the pulse width 
S in estimating 77DD at gate locations. But if i5 > tq, 
or more generally if the noise in preparations and mea- 
surements is modeled much differently than the noise in 
gates, then it may be more appropriate to consider the 
preparation/measurement noise strength to be a separate 
parameter in the analysis, not necessarily related to the 
parameters J and e that characterize the noise Hamilto- 
nian described in Sec. |TT] and appear in the estimate of 
77DD at gate locations. 

Measurement locations might be much noisier than 
gate locations because gates can be improved using se- 
quences of fast DD pulses, while slow measurements can- 
not be improved by DD. Or measurements might be nois- 
ier than gates for other quite different reasons. Previ- 
ous work has shown that scalable fault-tolerant quantum 
computing is still possible, and that the accuracy thresh- 
old is not much affected, when measurements are much 
slower than gates [i^. What deserves further study, 
though, is how fault-tolerant circuit design can be opti- 
mized when measurements are much noisier than gates. 
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VI. EXAMPLES 

Now we will analyze the effectiveness of several dif- 
ferent DD pulse sequences, applying the results from 
Sec. IV Al We adopt a noise model that includes only 
single-qubit errors acting on the system; thus the noise 
Hamiltonian is 

H = HB+Y.a2^<S>B2\ (63) 

i.a. 
(i) 

where i labels the qubits, aa for a — x^y^ z are the Pauli 
operators acting on qubit and 

Hb=Is®Bo. (64) 

In some realistic situations, such as electron-spin qubits 
interacting with a nuclear spin bath such 
single-qubit errors are the dominant noise in the system. 

In principle, -ffcn- could also contain errors that act 
collectively on several qubits at once; for example, errors 
acting jointly on two qubits might be expected to occur 
during the execution of a two-qubit gate. Efficient DD 
pulse sequences can be constructed that suppress multi- 
qubit errors (sol . [5l| . but in this Section we limit our 
attention to single-qubit noise and pulse sequences that 
combat it. The more general results in Sec. IV Al can also 
be applied to other models that include multi-qubit noise 
and to the corresponding pulse sequences that achieve 
first-order decoupling for such noise. 

We will discuss three different DD pulse sequences that 
can suppress the single-qubit noise. The first is the sim- 



plest DD scheme that protects against arbitrary single- 
qubit errors. The second is a time-symmetric sequence 
that achieves second-order decoupling in the limit of zero- 
width pulses. The third is the Eulerian DD scheme 0, 
which is more robust against pulse errors than the other 
schemes. 



A. Universal decoupling sequence 

The shortest pulse sequence that suppresses arbitrary 
single-qubit errors is called the "universal decouplin g se - 
quence" [1, [H, or "XY-4" in the NMR literature |53 ■ 
For this sequence, the unitary operator generated by the 
control Hamiltonian, acting on a single qubit, can be ex- 
pressed as 

Uc{tT>D) = ZIXIZIXI. (65) 

The notation in Eq. (|65p is meant to convey that one com- 
plete cycle of the memory sequence contains four equally 
spaced pulses (each of width 5) that successively apply 
the Pauli operators X, Z, X, Z, where X = Cx and 
Z = az\ therefore the product of the four Pauli opera- 
tors is —1. Each I in Eq. represents trivial evolution 
during the pulse interval of width tq — (5. The total du- 
ration of the pulse sequence is toD = 4ro. 

This sequence achieves first-order decoupling. In the 
limit of zero- width pulses, the toggling frame Hamilto- 
nian is 



\H1 = Hb + X (gi Bx + Y (gi By + Z ( 
XHX = Hb + X ® Bx - Y ® By - Z ( 
YHY = Hb - X (E) Bx + Y (E) By - Z d 
ZHZ = Hb - X (E) Bx -Y (E) By + Z 



) Bz for t e [0,To), 

5 Bz for t e [to,2to), 

Bz for t e [2to,3to), 

Bz fort e [3To,4ro), 



(66) 



and wc find 

f^i(tDD) = -M dtH{t) 

= -iTa {im + XHX + YHY + ZHZ) 
= -UbuHb, (67) 

a pure-bath term. The first-order Magnus term (up to 
the factor —Hbd) is the Pauli-group average of the noise 
Hamiltonian H, which commutes with any nontrivial 
Pauli operator acting on the system qubit. 

In a DD-protected gate, the final pulse in the universal 
decoupling sequence is modified by combining with the 
gate pulse. For a single-qubit gate, the pulse sequence 



realizing the gate G is 

U^(to) = iGZ)lXIZlXl, (68) 

where now GZ represents a single pulse with duration 5 
and to = ^DD- In a two-qubit gate, the universal pulse 
sequence is applied in parallel to both qubits, except that 
the final pulse Z (E) Z in the memory sequence is replaced 
by the two-qubit pulse G{Z (E) Z). 

To estimate the effective noise strength 77dd, we note 
that the total number of pulses is = 4, and that 
To /to = 1/4. From the bounds in Eq. ([SD]) and Table 
U] (for the case where the sequence is not time symmct- 
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ric) we obtain 



Vbd = (4Jto) 



5 
To 



1 
2 
11 

y 



(4ero) + 



9.43 (4ero) 



(69) 



where we have used Ci = NS/Iq because the pulses are 
regularly spaced in time. Note that the parameters J 
and e include sums over all qubits in the set Qa that 
participate in the gate at location a in the circuit. 



universal decoupling sequence 
-time— symmetric sequence 



ero =0.0711 




0.06 



0.08 



FIG. 5: (color online) Plot of tjdd/?? versus ero for the univer- 
sal decoupling sequence (Eq. I69p and for the time-symmetric 
sequence (Eq. I74p . assuming zero- width pulses. The noise 
strength for the DD-protected gate is weaker than the noise 
strength for the unprotected gate for ero < 0.0711 in the case 
of the universal decoupling sequence, and for ero < 0.0403 in 
the case of the time-symmetric sequence. For ero sufficiently 
small, using the time-symmetric sequence reduces the noise 
strength further than the universal decoupling sequence. 

In Fig. [SI rjuu/il (where i] = Jtq) is plotted as a func- 
tion of ero, in ttie limit S/tq — >■ 0. The noise suppression 
threshold condition 77DD < ?7 is satisfied when 



ero < 0.0711. 



(70) 



In the limit ero — > 0, the noise suppression threshold 
condition is satisfied for 

A < i m. 

ro 4 



B. Time-symmetric sequence 

We can construct a time-symmetric DD sequence by 
composing two copies of the universal decoupling se- 
quence — first we perform the sequence in the forward 
direction, and then run it backwards in time. For zero- 
width pulses, using the same notation as in Eq. (|65p . in 



which I represents trivial evolution for time ro between 
successive pulses, this sequence can be expressed as 



DD 



ixizixnxizixi, 



(72) 



where we have combined the two Z operators in the 
middle into a zero-width identity "pulse" [not shown in 
Eq. ([7^ ]. The total duration of the pulse sequence is 
^DD = 8ro, twice as long as the universal decoupling 
sequence. Like the universal decoupling sequence, this 
sequence achieves first-order decoupling. In addition, it 
satisfies the time-symmetry property Uc{tuD—t) = Uc{t), 
so that the toggling-frame Hamiltonian obeys i?(tDD ^ 
t) = H{t), and thus this pulse sequence achieves second- 
order decoupling as well. This pulse sequence is known 
in the NMR literature as "XY-8" [13]. 

For finite-width pulses, we modify our notation to em- 
phasize that the second half of the sequence is the time 
reverse of the first half. We write 

UcitDD) = ix'--hz'--hx'--hisix'-+hz^+hx'-+h. 

(73) 

Now, each I represents trivial evolution for time (ro — 6). 
The Is operator in the middle represents trivial evolu- 
tion for time S, arising from combining two Z pulses. 
It might seem more natural to use I25 instead, match- 
ing the total duration of two Z pulses each with width 
6, but we choose the sequence Eq. (|73p so that our up- 
per bound on 03(T), the dominant Magnus term when 
6/to is negligible, will have a simple form. Since (5/iDD 
is small anyway, it does not matter much which of these 
sequences we choose. X^^^ and Z^^'' represent finite- 
width pulses implementing X and Z. Before the mid- 
point of the sequence at t = tuD/2, the X pulses are 
executed using the constant Hamiltonian Hp^ such that 
X = exp(—i(5 iJp^ ) and the Z pulse is executed us- 
ing Hp^ such that Z = cxp{—i5Hp^), assuming the 
pulses are perfectly rectangular. After the midpoint, 
the universal decoupling sequence runs backwards; X 
is executed using —Hp^ and Z using —Hp^. Thus, 

UcitDD -t) = Uc{t). 

Appending the gate pulse to this memory sequence 
breaks the time symmetry, which can be nearly restored 
using the trick explained in Sec. IIV CI The region A in 
which the time symmetry is violated is the union of two 
intervals: the duration of the gate pulse, and its image 
under time reversal, during which evolution is governed 
by the Hamiltonian —Hp- Thus A = 2(5 (recall that we 
use A to denote both the region and its size). The DD- 
protcctcd gate contains N = 8 pulses (seven pulses in 
the memory sequence, including the identity pulse in the 
middle, plus the gate pulse) and has duration to = 8ro, 
so that To/to = 1/8 and T = to + 5. From the bounds 
in Eq. ([50)) and Table [Ij (for the case where the sequence 
is nearly time-symmetric) we obtain an estimate of the 
effective noise strength t^dd of the DD-protcctcd gates; 
we may use Ci = NS/T < S/tq because the pulses are 
regularly spaced in time. 
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In the limit of zero- width pulses ((5/to 0), the effec- 
tive noise strength becomes 



flDD = (8Jro) 



9.43(8ero) 



(74) 



'7dd /?7 is plotted in Fig. [S] The noise suppression thresh- 
old condition t^dd < ^7 is satisfied when 



ero < 0.0403 



(75) 



This condition is more stringent than for the universal 
decoupling sequence, which is not surprising since the 
time-symmetric sequence is twice as long. As Fig. [5] 
illustrates, the time-symmetric sequence becomes more 
advantageous when ero is small, as is likely to be the case 
when 77DD is below the accuracy threshold 770- In the limit 
ero — ?> 0, only Ci survives, and we find 77DD < 877((5/to); 
thus the noise suppression threshold condition is satisfied 
for 



6 1 



(76) 



The largest permissible pulse width is half as large as in 
the case of the universal decoupling sequence [Eq. fH} ] 
because the time-symmetric sequence is twice as long. 

Using Eq. (p^ and the expressions for 77DD in Eq- (ESI) 
(with 5/ To ~ 0) and Eq. ([74)) . we plot in Fig. [6] the ra- 
tio L5jjj-|/L*jj versus ero foi' both the universal decoupling 
sequence and the time-symmetric sequence. Here, just 
to illustrate the idea that DD can drastically reduce the 
overhead requirements for fault-tolerant quantum com- 
puting, we have assumed 770 /r] = 2, and we have taken the 
value a = log2(291) w 8.18 from ^ (291 is the number 
of locations, including measurements and preparations, 
contained in the fault-tolerant CNOT gadget constructed 
in [HI). Because the noise strength for the unprotected 
gate is close to the threshold value, and because ero is well 
below the noise suppression threshold for each DD se- 
quence in the range plotted, the reduction in the number 
of pulses achieved by using DD-protected gates is sub- 
stantial. Furthermore, although the time-symmetric se- 
quence is longer than the universal decoupling sequence, 
the time-symmetric sequence reduces the total number 
of pulses more effectively than the universal sequence, by 
more than an order of magnitude for ero < 10~^. 

In brief, the overhead improvement achieved by DD, 
illustrated by Fig. [51 arises as follows. The accuracy 
threshold analysis and overhead estimate in [3l| is based 
on concatenated coding, a hierarchy of codes within 
codes. The number of coding levels k needed to simulate 
accurately a circuit of fixed size varies with the effective 
noise strength fj according to 



2'= oc 



1 



log (7^0/77) ' 



(77) 



and the number of noisy gates used in the fault-tolerant 
simulation grows like 2°'^. By improving the effective 



noise strength, DD reduces the number of levels needed, 
substantially reducing the overhead cost. This savings 
in the number of gates more than compensates for the 
additional pulses used to achieve the DD improvement 
of each gate. 
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FIG. 6: (color online) Plot of Z/Do/iun (Eq. (f62]l ') versus ero 
for the universal decoupling sequence and the time-symmetric 
sequence, in the limit S/tq — 0. We have assumed rjo/ri — 2, 
and have used the value a = log2(291) ~ 8.18 appropriate for 
the fault-tolerant gadget constructions in [sil ]. 

For some noise models, the value of 7700 derived by 
our general arguments may be overly pessimistic. For 
example, using the time-symmetric sequence Eq. (j72[) . 
we computed fl3{T) for a single-qubit system coupled to 
an 7T.-spin bath in an external magnetic field, assuming 
an isotropic Heisenberg interaction between the system 
qubit and each bath spin. The ratio of the bound from 
Eq. §U\) and Table U to the actual value of ||f^3(T)|| for 
this model is plotted in Fig. [7] as a function of (Stq, for 
Hg = and (5 = 0. The bound is larger than the actual 
value by at least a factor of 20. 



C. Eulerian decoupling sequences 

The effects of finite pulse width and other pulse imper- 
fections can be suppressed by using an "Eulerian" mem- 
ory sequence Q- In Eulerian decoupling, the operator 
applied by each pulse is the generator of a finite group, 
and Uc{t) traverses an Euler cycle in the Cayley graph of 
this group. As a result, the error Hamiltonian is group 
averaged and first-order decoupling is maintained even 
when the pulses have (reproducible) imperfections. We 
will describe a simple Eulerian memory sequence here; 
see for a more general discussion. 

A simple Eulerian memory sequence protecting against 
single-qubit noise is 



Ucituu) ^ XIZIXIZIZIXIZIXI. 



(78) 
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FIG. 7: (color online) Plot of H^a llBound/llfis lUctuai versus 
/3ro for the time-symmetric DD sequence Eq. (|72[) . The noise 
Hamiltonian is H — Hb + Hsb [Hg = 0), where Hb ~ 
(/3/2) -^f and Hsb = (J/4) E.=.,,.. 
index i labels the bath spins. Here \\il3 
using Eq. ((50]) and Table U (where 5 = and T = 8ro), while 
I 1 1 Actual is computed by evaluating the integral in Eq. (|28|) 
exactly. The kinks in the plots arise because the operator 
norm can have a discontinuous first derivative when eigenval- 
ues cross. 



Here the pulses are equally spaced in time; each I oper- 
ator represents the same time interval, and the spacing 
between the start of two consecutive pulses is tq. This 
sequence looks like two repetitions of the universal de- 
coupling sequence, except that the X and Z pulses are 
swapped in the second repetition. In contrast to the time- 
symmetric sequence Eq. (|73p . we use the same Hamilto- 
nian Hp^ to execute each X pulse, rather than reversing 
the sign of the Hamiltonian during the second half of the 
sequence; similarly we use the same Hamiltonian Hp^ to 
execute each Z pulse. 

Without making any assumption about the pulse 
widths or shapes (except for assuming that all X pulses 
are alike and that all Z pulses are alike) , we may express 
the unitary evolution operator over the pulse interval of 
duration tq as ux(t) for an X pulse and uz(t) for a Z 
pulse. Then, for t G [0,iDD = Stq], Uc{t) becomes 







t e 


[0,ro) 


uz{t - 


ro)X 


t e 


[to,2to) 


ux{t - 


2To){iY) 


t e 


[2ro,3ro) 


uz{t - 


'Sro){-Z) 


t e 


[3to,4to) 


uz{t - 


4ro)(-I) 


t e 


[4to,5to) 


ux{t - 


5ro)(-Z) te 


[5ro,6ro) 


uzit - 


Qro){iY) 


t e 


[6ro,7ro) 


. uxit- 


7to)X 


t e 


[7to,8to) 



(79) 



The first-order Magnus term r2i(tDD) can be expressed 
in terms of effective Hamiltonians Hx and Hz , obtained 
by averaging the Hamiltonian over an X or Z pulse re- 



ToH 



dt u\{t)Hux{t), 

pro 

ToHz= / dtu\{t)Huz{t). 
Jo 



(80) 



Since ux and uz act only on the system, they commute 
with the bath Hamiltonian Hp ; while averaging over the 
pulse alters 7?orr, it has no effect on Hp. Therefore we 
find that 

niituu)^ f°°dtH{t)= r° dtUl{t)HUc{t) 
Jo Jo 

= To {Hx + XHxX + YHxY + ZHxZ) 
+ To {Hz + XHzX + YHzY + ZHzZ) 
^SHpTo; (81) 

thus fli{tuu) is a pure bath term. To derive the last 
line of Eq. ([5T|) . we have used the property H + XHX + 
YHY + ZHZ ^ 4Hp [as in Eq. §7}]. We conclude that 
first-order decoupling is perfectly attained irrespective of 
the pulse shape, as long as the same ux{z){t) is applied 
for every X{Z) pulse, and the integrated pulses are ex- 
actly right. 

To demonstrate the advantage of using an Eulerian 
memory sequence, let us compare it with the univer- 
sal decoupling sequence, taking into account finite pulse- 
width effects. The effective noise strength of the uni- 
versal decoupling sequence is given in Eq. (I69p . For the 
Eulerian decoupling sequence described in Eq. (|78|) . the 
effective noise strength is given by a similar expression, 
but with 4to replaced by 8ro to account for the longer 
Eulerian sequence {N — 8). Furthermore, in this case 
we can drop the first-order, pulse-width dependent term 
6/to, which gives 



?7edd = (8Jro) 



(8ero) + -(8ero) 



9 



-|-y(8eTo)3-t-9.43(8ero)^ 



(82) 



The comparison between the universal decoupling se- 
quence and the Eulerian decoupling sequence is illus- 
trated in Figs. [5] and [HI with numerical values delineat- 
ing different regions easily deduced by solving the corre- 
sponding inequalities comparing Eqs. ([69]) and ([82]). 

Adding a gate pulse G, by combining G with the final 
X pulse of the Eulerian memory sequence, introduces an 
error depending on the width of the final pulse. How- 
ever, because this nonvanishing contribution to Oi (T) 
arises only from the final pulse, it does not depend on 
the length of the memory sequence. Other contributions 
to n(T) that depend on pulse shapes, in the second order 
of the Magnus expansion and beyond, are suppressed by 
additional factors of etq. 

The contributions that depend on the pulse shape can 
be further suppressed by making the Eulerian memory 
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FIG. 8: (color online) Comparison of effective noise strengths 
rjoD and 77EDD for the universal decoupling sequence given in 
Eq. (|65|l (for different pulse-widths) and the Eulerian decou- 
pling sequence given in Eq. (|78p . respectively. The univer- 
sal decoupling sequence is always worse than no decoupling 
{r] = J To) for 5 /to > 1/4, and Eulerian decoupling is worse 
than no decoupling for ero > 0.0239. The Eulerian sequence 
is always better than universal DD for 5/to > 0.1983. For 
smaller values of S/tq, as the pulse-width increases, the Eu- 
lerian sequence outperforms the universal sequence for small 
values of ero. However, because of its longer length, the Eule- 
rian sequence offers no advantage over the universal sequence 
or no decoupling when ero is too large. 




0.005 0.01 0.015 0.02 0.025 0.03 0.035 0.04 



FIG. 9: (color online) Illustration of the parameter regions 
in which no dynamical decoupling, the universal decoupling 
sequence (DD), Eq. (|65p . or the Eulerian decoupling sequence 
(EDD), Eq. (|78p . emerges as the best strategy. Different re- 
gions indicated correspond to the following inequalities: (1) 

'7EDD < '7noDD < T^DD; (2) r]EDD < T^DD < T/noDD; (3) 

rjDV < rjEOD < r}noDV, (4) rjDU < ??noDD < JJbdd; (5) 
??noDD < Vdd < 77edd; (6) T^noDD < Vedo < J?DD. The uoise 
strengths are given by 77noDD = Jro and Eqs. (f69)) . (f82)) . 



sequence time-symmetric. Consider, for example, the se- 



quence 

Ucituu) 

= x^-hz^-hx^-hz^-^iz'--Hx'--Hz^-hx^-H 
X ix^+hz^+hx'^+hz^+hz^+hx^+hz^+hx^+l (83) 

where the control Hamiltonian is chosen such that 
"x(-)(*DD-i) = ux{+)it) and Uz{-)ituB - t) = Uzw{t)- 
Because this sequence obeys the time symmetry condi- 
tion [/c(tDD ^ ^) = Uc{t), the even-order Magnus terms 
vanish. Furthermore, because Eq. (j83p is just two copies 
of the Eulerian sequence Eq. ([75)) . the first running 
backward in time and the second running forward, the 
sequence achieves first-order decoupling for any pulse 
shape. Corrections depending on the pulse shape en- 
ter only in third order and beyond. Of course, making 
the Eulerian sequence time-symmetric (or making the 
time-symmetric sequence Eulerian) lengthens the pulse 
sequence and so increases the time T appearing in the 
Magnus expansion. Whether using this longer sequence 
actually improves the noise suppression depends on the 
values of the parameters etq, Stq and tq/Iq, but it could 
pay off if the pulse width is relatively large, as suggested 
by Figs. [5] and [SI Adding a gate pulse to the time- 
symmetric Eulerian memory sequence spoils the first- 
order decoupling and breaks the time symmetry, but the 
resulting contributions to ili and fl2 depend only on the 
width of the final pulse, not on the length of the pulse 
sequence. 

Eulerian DD-protected gates that achieve exact first- 
order decoupling for nonzero-width pulses can be de- 
vised using the dynamically corrected gates recently in- 
troduced in [s^ . [in . This scheme is based on the idea 
that distinct gates can have related errors, so that the er- 
rors cancel for a suitably constructed pulse sequence. The 
errors in distinct gates can be similar if the gates are con- 
structed from control unitaries that traverse similar time- 
dependent paths, differing only by rescaling or reversing 
the time along the path. Arbitrary-order decoupling for 
nonzero-width pulses can be achieved by concatenating 
dynamically corrected gates [56| . 



VII. DERIVATIONS 

In this section, we derive the coefficients for the bounds 
on the Magnus expansion listed in Table HI The Magnus 
expansion is computed for the Hamiltonian HM(t) given 
in Eq. (g^l); at any time t, HM{t) = ±HB + H'{t), where 
H'{t) is either or iforr(i)- The two terms in HM{t) are 
bounded as \\Hb\\ < P and \\H'{t)\\ < J; thus ||^fAf(t)|| < 
/3 + J = e. The Magnus terms can be cornputed from 
Hnj{t) using the following recursive formulas [57[, derived 
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in Appendix El 









(84a) 




/ dtA{t); 
Jo 




(84b) 




Ef / dtSi^'i^)' 


n>2; 


(84c) 


sip it) = 


[nn-i{t),A{t)]; 




(84d) 


siPit) = 


n—j 

J2 [n„.it),si=-p{t) 

m—1 


,2<j<n- 


-1, (84e) 



where {Bj} are the BernouUi numbers. Exphcit formulas 
for n2{T) and flsiT) were given in Eqs. ^ and (^5]) . 



A. General case: Magnus expansion 

For the general (i.e., not time-symmetric) case, Table|I] 
gives Ci = N6/T for regularly spaced pulses or 2N6/T in 
general, Cs = 1/2, C3 2/9, C4 = 11/9 and C5 = 9.43. 
Now we derive these coefficients. 



1. Bound for Q'l 

We assume that first-order decoupling is attained, so 
that in the limit of zero- width pulses H{t) for the memory 
sequence satisfies Eq. ([HI]): J^°° dtHe„fi{t) = 0. Recall 
that the subscript "0" on ifcrr means we are to take 6 
to zero in _ffcrr(i) while holding tq fixed. If a zero- width 
gate pulse is appended to the memory sequence, then 
Hci-i.o{t) in the DD-protectcd gate differs from Hci-r,o{t) 
in the memory sequence only during the final instanta- 
neous pulse, and therefore still integrates to zero. Hence 
the DD-protected gate as well as the memory sequence 
satisfies rJi(T) = -iTHe and n[{T) = 0. 

When the pulses have finite width {S > 0), il'i picks up 
corrections that depend on 6. Noting that H{t) differs 
from Ho{t) only during the pulses, we write 



n[{T) = dtH{t) + HqHb 

Jo 

pit) 

= ~i dtHo{t) + UoHb 
Jo 

+ i dtp^^Ho{t) — i / dtpwH{t) 
Jo Jo 

fio pio 

= 11 dtpwHo{t)-i dtpwH{t). (85) 











Here, dtp-w indicates integration only over times within 
the pulses. Now, Ho{t) = Hb + Hcirfi{t), so for a se- 
quence with N pulses (including the gate pulse), we 
have i Jq° dtpwHo{t) = iNSHb + i Jq° (iipw-ffcrr,o(0 ^-^id 



-i J*°dtpwH{t) = -INSHb ~ i J*° dtpwHcrr{t). The 
two iNSHb terms cancel, and we are left with 

pi(i pto 

n[{T) = i dtpwHcrrAt) - i dtpwHcnit). {86) 
Jo Jo 

The second term can be upper bounded by NSJ. For 
the first term, Eq. tells us that for 6 = 0, Ho{t) = 
jjik) _ jj^ ^ g [sfe,Sfc+i). Hence, we have 

that i /g" (itpw-fferr,o(0 = iSJ2k ^o^P Now, the first- 
order decoupling condition can be written as 

' dm„,o = E(sfe+i - = 0. (87) 



/ 

Jo 



If all the pulses are regularly spaced in time, so that 
Sfc+i — Sfc are all equal for all k, this condition implies 

~ (k) 

that -f^err = 0. In this case, the first term of the right- 
hand side of Eq. ([55]) vanishes and il'i (T) is bounded by 
the norm of the second term only: 

\\n',iT)\\<NSJ=^{JT). (88) 

Hence, Ci = NS/T if pulses are regularly spaced in time. 
Even if the pulses are not regularly spaced in time, this 

value of Ci works whenever J2k ^err = 0. Otherwise, we 
can still upper bound the first term in Eq. by NSJ, 
so that \\n[{T)\\ < 2NSJ = {2N5/T){JT). This gives 
Ci = 2N5/T in general. 



2. Bound for Q.2 

We will derive an upper bound on 

n2{T) = -]-( ds^ r ds2[HM{si),HM{s2)] (89) 
^ Jo Jo 

where HM{t) = Hb + H'{t). The term quadratic 
in Hb vanishes, because Hb is time independent and 
[Hb,Hb] = 0. The term of linear order in Hb can be 
expressed as 

-If dsi r ds2[HB,H'{s2)-H'{si)]. (90) 
^ Jo Jo 

Lat 

pT ps\ pT pT 

I dsi / ds2 = ds2 / dsi; (91) 

Jo Jo Jo Js2 



We note that 



either way we are integrating over the triangle with S2 < 
si < T. Therefore, 

/ dsi / ' ds2 [Hb,H'{si)] = [ ds s[Hb,H'{s)], 
Jo Jo Jo 

[ ds2 [ ds, [Hb,H'{s2)] 

Jo J S2 

= [ ds {T-s)[Hb,H'{s)]. (92) 
Jo 
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Combining terms we find 



f dsi f ' ds2 [Hb,H'{s^)-H'{s2)] 

Jo JQ 

= r {2s-T)[Hb.H'{s)1 
Jo 



(93) 



and hence 



^ Jo Jo 

F 1 

< J/3 / ds\2s-T\ = -J^T^. 
Jo 2 



(94) 



This bound on the sum of two terms is better by a fac- 
tor of two than we would have found by bounding the 
two terms separately, because of a partial cancellation 
between the two terms. 

We bound the term in fi2(T) of zeroth order in He 
using 



[H'{s,),H'{s2)]\\<2J^ 



(95) 



and therefore 



\ [ dsi r ds2[H'{sr),H'{s2)] 
^ Jo Jo 

< i(2j2)(rV2) = ij^T^. 



(96) 



Combining with the terms linear order in Hb we obtain 
\mT)\\ < ^JpT^ + ij^T^ = i(er)(JT), (97) 
where e = /3 + J; hence C2 = 1/2. 



3. Bound for fls 
The integrand in the expression Eq. for ft3{T) is 

'-{[HMisi),[HM{s2),HM{s3)]] 



+ [Hm{s3), [Hm{s2),Hm{si)]]), (98) 

where Hm{s) = Hb + H'{s); because [Hb,Hb] = 0, the 
term cubic in Hb vanishes, and the terms quadratic in 
Hb can be written in the form 

'-[Hb, [Hb,H'{s,) + H'iss) - 2H'{s2)]]. (99) 
The time-ordered integration 



dsi / ds2 / dss 
Jo Jo 



(100) 



can be expressed as dsi (s^/2) for a fimction indepen- 
dent of S2, S3, as ds3 ^(r — 53)^ /2^ for a function in- 
dependent of si, S2, and as ds2 S2 (T — S2) for a func- 
tion independent of si, S3. Therefore, the contribution to 
^siT) quadratic in Hb is 



ds[HB,[HB,H\s)]] 

X Qs2 + i(r-s)2-2s(r-s; 



quadratic 



using 

||[i/B,[i?s,ff'(s]]|| <4/3V, 
it can be bounded as 



(101) 



(102) 



< ds 



l/- + ^{T-sf-2siT-s) 



3 3^/3 9\/3 



{(3Tf{JT). 



(103) 



(The integrand has zeros at s± = i ± it is positive 
in [0,s_] and [s+,r], negative in [s_,s_|-]. The integrals 
over these three intervals are respectively jf^yf' 
-^^—!=, and the integral of the absolute value is ^7=.) 

12V3' ° 3\/3 ' 

Now consider the terms linear in Hb , with integrand 



-([523] + [S21] + [1S3] + [351] + [12B] + [32B] 
6 



where 

[i?23] = [ifB,[H'(s2),i/'(s3)]], 

etc. We note that 

dsids2ds3([lB3] + [12B] 

T>,Si>,S2>S3>0 

dsidsa (si - S3)[1B3] 



(104) 
(105) 



T>si>,S3>0 



dsids2 S2[1S2] 



T>si>S2>0 



T>si>s2>a 



dsids2 (si - 2s2) [IB2], (106) 



and hence 



/ 

Jt: 



T>Si>S2>S3>0 

< \\[^B2]\U 



dsids2ds3{{lB3] + [12B]) 
dsids2 \si — 2s2 



T>si>S2>0 



6 3 



(107) 
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Similarly, 



/ 

Jt>si 



>S2>S3>0 



dsids2ds3{[3Bl] + [32B] 



T>si>S3>0 



dsids3 (si - S3)[3Bl] 



ds2ds3 (T- S2)[3B2] 



T>S2>S3>0 



dsidss (2si-S3-r)[3Bl], (108) 



IT>si>S3>0 

and hence 



dsids2ds3{[3Bl] + [325]) 



T>Si>S2>S3>0 

< i|[3Bl]|lmax 

X / dsids3 |(r-s3)-2(r-si)| 

JT>T~S3>T-si>0 



7^3 2 

6 3 



(109) 



Also, 



/ 

Jt>si 



>S2>S3>0 



dsids2ds3{[B2-i\ + [S21]) 



T>S2>S3>0 



ds2ds3 (r-s2)[B23] 



T>si>S2>0 

dssdsa (r-S2-S3)[S23], (110) 



/T>S2>S3>0 

and hence 



/ 

Jt>si: 



>S2>.53>0 

< II [523] I 

T>S2>S3>0 

6 3 



dsids2ds3([S23] + [B21]) 

ds2ds3 IT-S2-S3I 



(111) 



Combining these three bounds, we obtain an upper 
bound on the terms in n^iT) linear in Hb- 



ll[^^3(r)],_J|<^x3x (^/3J2t-^ 



1 



(112) 



For the term in ^.^{T) independent of Hb, we have 

l|[^^3(r)],_,, 

—order 1 1 

1 fT^\ 



<-(^— j (2)11 [H'{s,),[H'{s2),H'{s3)]] II 
liJTf. 



Putting together the bounds on the terms of second, first, 
and zeroth order in Hb, we find 

||Oj(r)|| < ^(flTfiJT) + \(flT){JTf + liJTf 



(114) 



Using J < e, we obtain a weaker but simpler bound 

2 



||f^3(T)|| < lieTfiJT). 



(115) 



Hence C3 = 2/9. 



4- Bounds for Sl„>4 



To bound the Magnus terms for n > 4, we use the 
recursive formulas Eq. (|84p and ideas from [5^, [5^ . In 
Appendix lEl we show that the Sn ^ operators satisfy: 



\\si^Hm<fi'^j{2etr-' 



(116) 



(J) 



for all n > 2. 1 < j < n — 1, where the coefficients /, 
are given in Eq. (|E2p . Using this, we can write down 
bounds for fln>4 as follows: 

\\nn{T)\\<J2i-^ / ds\\S^^\s)\\ 

n ^ — ^ 1 1 



n ^ — ' 7 



= /„(jr)(4eT)"-i, 
where the coefficients /„ are defined as 



(117) 



(118) 



Using Eq. (jllSp and the recursive formula for fn'^ from 
Eq. (Hil, one can show that fi = 11/576. Then, fi4(r) 
can be bounded as 



|f^4(r)|| < ^{JT){4eTf, 



(119) 



so C4 ^ 4^(11/576) = 11/9. 

The bounds for f2„ for n > 5 can be gathered together 
into a single bound by writing 



(113) £ II a. (T) II <(JT) (4er)^ 



n— 5 



n — 5 



.71 — 5 



(120) 
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In [531, the {fn} were shown to be coefRcients in the 
power series expansion of G~^{y) = fnU^', G~^{y) 

is the inverse function of 



y = G{s) 



dx 



(l-eot|) 



(121) 



defined for domain — 27r < s < 2tt, the interval over which 
G{s) is monotonically increasing. A self-contained proof 
of this fact is provided in Appendix |F1 We want to relate 
the expression in the brackets in Eq. (|120p to G^^. Define 
C as 

C = G(27r) = 2.17374..., 6-^(0 = 27r, (122) 

and assume that eT < 0.54 so that 4eT < (. Then, 
G~^{4:eT) < 27r since G{s) is monotonically increasing 
over its domain, and therefore, 



E 

.n— 5 



/n(4er) 



n— 5 



n— 5 



1 



G-^(c)-E/"C'^ 



ri=l 



^ /3 



Using /i = 1, /2 = i, /a = ^ and /4 
can be derived from Eq. (fTTS)) and Eq. (|E2]), E"q. (fT23l) 
implies 



11 

576' 



(123) 



which 



^/„(4eT)"-5 



.n— 5 



< 0.03685. 



Then, 



E 

n— 5 



||f^„(T)|l <C'(JT)(4er)^ 



(124) 



(125) 



Therefore, C5 = 4" x C" ~ 9.43. 

Note that the condition eT < 0.54 is more stringent 
than the sufficient condition for convergence of the Mag- 
nus expansion given in Eq. (j29p . which requires eT < tt. 
If 0.54 < eT < TT, we need to use a different method to 
find an upper bound on the sum of the high-order Mag- 
nus terms. 



B. General case: Dyson expansion 



In Scc lVIlXl we used the Magnus expansion to obtain 
bounds on the noise strength of DD-improved quantum 
gates. Here we derive bounds on the noise strength by 
a different method based on time-ordered perturbation 
theory in the toggling frame. These new bounds are eas- 
ier to derive than those in Sec. IVII Al and they apply 
without any upper bound imposed on the expansion pa- 
rameter eT; furthermore, in the case of a pulse sequence 
that achieves third-order decoupling, they are actually 
tighter than the previous bounds. Unfortunately, in the 



case of first-order or second-order decoupling, they are 
not as tight. In this derivation, we will assume pulses 
have zero width, and we will consider only the general 
case (without time symmetry). 

In the local-bath model, we consider the toggling-frame 
system-bath Hamiltonian 



(126) 



which describes the noise at a particular circuit location. 
Here XHb is the Hamiltonian of the local bath (acting 
trivially on the system) and XHcn is the Hamiltonian 
responsible for the noise (acting jointly on system and 
bath) . We have introduced the coupling parameter A here 
for convenience, to keep track of terms in the Dyson and 
Magnus expansions, and we will set A = 1 momentarily. 

Consider the toggling-frame time-evolution operator 
U{T) obtained by integrating the Schrodinger equation 
with Hamiltonian XH(t) for time T (if the control unitary 
Uc{T) for this time interval is the identity — i.e. if the 
control sequence is cyclic — then the toggling-frame and 
Schrodinger picture evolution operators coincide). The 
Dyson expansion is the expansion of U{T) in powers of 
A: 







/ dh-- 









i)---H{t^)) , 

(127) 

where T denotes time ordering. The Magnus expansion 
is the expansion of the logarithm of U{T) in powers of A: 



[/(T) = expK^A"a„(T) 



(128) 



\n=l 



We say that the control sequence achieves nth order de- 
coupling if the first n terms in the Magnus expansion 
are pure-bath terms, acting trivially on the system. By 
expanding the exponential in Eq. (|128p and comparing 
with Eq. (|127p . we see that for a control sequence that 
achieves nth order decoupling, the terms of order A™ for 
TO < n in the Dyson expansion are all pure bath terms 
[and that the (n-l-l)st-order term in the Dyson expansion 
is A"+^ (ri„-)_i(T) + •••), where the ellipsis represents a 
pure bath term.] 

In Sec. IIVI we defined the effective noise strength 7700 
as an upper bound on the deviation of the noisy operation 
U (T) from a pure-bath unitary operator U b (T) : 



77DD = max||;7(T)-C/B(T)|| 



(129) 



This definition was convenient because each order in the 
Magnus expansion is anti-Hermitian, so that in the case 
where nth-order decoupling is achieved, the exponential 
of the sum of the first n terms in the Magnus expansion 
is a pure-bath unitary. However, when we express the 
noisy unitary as the sum of good and bad parts (where 
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the good part acts trivially on the system), it is not nec- 
essary for the good part to be unitary — the criterion 
for scalable quantum computing is j] < rjo (where rj is 
the operator norm of the bad part) whether the action of 
the good part on the bath is unitary or not. Therefore, 
to estimate the noise strength, we can separate the terms 
in the Dyson expansion into pure bath terms (whose sum 
is not necessarily unitary) and remaining terms that may 
act nontrivially on the system. Then the noise strength 
r] is an upper bound on the operator norm of the sum of 
these remaining terms. 

The operator norm of the nth order term in the Dyson 
expansion Eq. (|127|) can be bounded above by ^(eT)" 
(with A now set equal to 1). This is simply an upper 
bound on the norm of the integrand times the volume of 
the integration region. But we can also do a double ex- 
pansion of the nth order term in Hb and Hcrr, bounding 
each term separately (60j . In this double expansion, the 
terms that are zeroth order in -ffcrr sltc of course pure 
bath terms, and their sum has operator norm bounded 
above by ^(/3T)". Thus, the upper bound on the sum 
of all the order n terms in the Dyson expansion that are 
not zeroth order in J is 



— ((/3 + J)" - n 



(130) 



To express this bound in terms of e, we note that /(/3) = 
/3" is a convex function for n > 1, so that f{/3) > f{l3 + 
J) - Jf{J + P); thus, 

(/3 + J)" - /3" < nJ(/3 + J)"-i = 7iJe"-i, (131) 

and the upper bound in Eq. (jl30l) becomes 

1 



(n-1)! 



{JT){eTy 



(132) 



Now consider a cyclic control sequence that achieves 
nth order decoupling, so that all terms up to nth order 
in the Dyson expansion are pure bath terms. We estimate 
the effective noise strength using an upper bound on the 
non-purc-bath parts of all higher order terms, finding 



??DD 



m— n+ 



1 



1 (™ - 1) 



iJT){eTy 



(133) 



Thus, by using the Dyson expansion rather than the Mag- 
nus expansion we have found, we read off C„ = 1/ (n— 1)! 
for n = 2, 3, 4 from Eq. (fTSS]) . i.e.. 



Co 



1, C3 - 1/2, C4 = 1/6, 



(134) 



and 



00 ^ 

= E 7 ttt(^^) 

(m — 1)! 



m— 5 



m— 5 



e"" -l-x-x^/2-x^/6 



24 ^ 120 720 



x=eT 

0.0466, 



(135) 



where the numerical value of C5 was obtained by setting 
eT = 0.54 in order to have a meaningful comparison with 
the C5 value we obtained from the Magnus expansion. 
Thus, comparing with the bounds derived in Sec. IVII Al 
we have improved the values of C4 and C5 substantially, 
but not the values of C2 and C3. This means that for 
a (not time-symmetric) cyclic control sequence achieving 
third order decoupling, we get a smaller value for ?7dd 
using the Dyson expansion rather than the Magnus ex- 
pansion. 



C. Time-symmetric case 

Now we derive bounds on the Magnus terms that apply 
when the pulse sequence is time-symmetric except inside 
a small region A C [0, T]; as before, we use A to denote 
both this region and its size. As in Eq. (|49p . we are inter- 
ested in the Hamiltonian Hi\i (t) describing evolution for 
time r governed by the Hamiltonian —Hb, followed by 
evolution for time T — T governed by the toggling-frame 
Hamiltonian of a DD-protected gate. But our analysis 
in this Section applies to any Hamiltonian HM{t) that is 
time-symmetric outside region A. 

Even Magnus terms vanish when A = 0, and we will 
derive explicit A-dependent bounds on (T) and Vti{T), 
which are linear in A when A is small. We could also 
exploit the time symmetry to derive improved bounds 
on the higher-order Magnus terms (r2„>5(r)); however 
we will not bother to do so. Instead we use the same 
upper bounds on these terms that apply in the general 
case, with the expectation that these bounds are already 
quite small in typical cases of interest. 

To obtain a bound on 0,2{T) for a nearly time- 
symmetric sequence, we observe that the double time 
integral in Eq. can be split into four cases: (i) 

si,S2 i A, (ii) Si e A,S2 </ A, (iii) si ^ A,S2 e A 
and (iv) si,S2 G A. The contribution from case (i) van- 
ishes, because HM{t) is time-symmetric in this region. 
The contribution from the remaining three cases can be 
bounded by 

\\^2{T)\\ < ^\\[Hm{si),Hm{s2)]\U^- Volume 

< 2Je- Volume, (136) 

where "Volume" means the total volume of integration 
regions (ii), (iii), and (iv) combined. 

We recall that the integral is time ordered, so that 
si > 32- The region A is the union of a disjoint set of 
intervals {A^}. We assume these intervals are labeled 
consecutively, so that A^ > A,; for j > i. For case (ii), 
if si £ Ai, then S2 lies in the part of r\A less then A^. 
Call this region r<i. Similarly, for case (iii), if S2 £ A^, 
then Si lies in the part of T\A greater then A^. Call 
this region T>i. Adopting the convention in which the 
same symbol is used to represent both a region and its 
length, the total integration region for cases (ii) and (iii) 
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combined has volume 

A« (r<,) + A, (T>,) = Y.A, (T<, + r>,) 

-^A,(r- A) = A(T~ A), (137) 

with the first contribution coming from case (ii) and the 
second from case (iii). 

For case (iv), if Si and S2 are in the same interval A^, 
the integration region has volume ^A^. If si G A.; and 
S2 ^ Ai, then S2 € Aj for j < i. Summing the volumes 
of all regions with si, S2 G A gives 



i i<j \ i J 



A^. (138) 



Adding the contributions from cases (ii), (iii), and (iv), 
we find that the total volume is AT — ^A^, and conclude 
that fell 

\\^2{T)\\ < 2Je- Volume 

= 2^(^1- A^(JT)er). (139) 

Hence, C2 = 2(A/T) (1 - A/2T). 

Since each even Magnus term vanishes in the time- 
symmetric case, there are upper bound on all even Mag- 
nus terms that depend linearly on A/T to lowest order. 
Such bounds are derived in Appendix |D] For f24(r), 
Eq. dnzl yields 



|5^4(T)|1 < U{JT){eTf 
= U{JT){eTf 



T' 



3 



(140) 



Since 4(A/T)3 < 4(A/T)2 and (A/r)^ > 0, we can 
rewrite this as 



|r!4(T)|| < U{JT){eTf 



A A 
4 2 — 

T \T 



56^ (1 - |; ) {JT){eTf 



(141) 



Hence, C4 = 56(A/r) (1 - A/2r). (For n2{T) the 
bound Eq. (jD7P is actually weaker by a factor of 2 than 
Eq. ()139p . because a looser estimate of the integration 
volume is used to derive Eq. (|D7p .) 



VIII. CONCATENATED DYNAMICAL 
DECOUPLING 



A concatenated DD pulse sequence is a recursively gen- 
erated sequence with a self-similar structure [ifll . For 



example, from the "level- 1" universal pulse sequence 

Pi = ZIXIZIXI (142) 

we obtain the corresponding "level-2" sequence by re- 
placing each pulse interval I in the level- 1 sequence by 
the complete level-1 sequence pi, obtaining 

P2 = ZpiXpiZpiXpi, (143) 

similarly, the level- fc sequence is 

Pk = Zpk^iXpk-iZpk^iXpk-i- (144) 

If the duration of a single pulse is tq and pi is an i?-pulse 
sequence that achieves first-order decoupling, then the 
corresponding level-A: sequence pk has duration T'^'^-' = 
R'^tq and achieves fcth-order decoupling; i.e., has effective 
noise strength 0{Je^). 

The advantages of concatenated DD over standard 
periodic pulse sequences (such as cycles of the uni- 
versal decoupling, or XY-4 sequence) have been docu- 
mented numerically [lol [63 - l65j and confirmed in a num- 
ber of recent experimental studies (66l - l69j . Concatenated 
pulse sequences are substantially less efficient than "op- 
timal" sequences with nonuniform pulse intervals that 
achieve fcth-order decoupling with exponentially fewer 
pulses [H, [H, |4l|, [70l - [72i] . but nevertheless have some 
nice properties. For one thing, concatenated pulse se- 
quences are relatively robust against pulse imperfections, 
because pulse errors arising at each level get suppressed 
at higher levels. Experimental evidence for this robust- 
ness was provided in a recent NMR study of a qubit in a 
rapidly fluctuating spin bath, where pulse imperfections 
played a role, and concatenated DD sequences outper- 
formed a variety of other sequences, including "optimal" 
ones with nonuniform pulse intervals, in preserving an 
unknown quantum state [66j . 

We will analyze the performance of concatenated pulse 
sequences in two ways, first using the Magnus expansion, 
and then in Sec. IX Fl using the Dyson expansion and bath 
correlation functions. 

Before presenting the analysis, we briefly state our 
main results. For ideal, zero-width pulses, we flnd that 
if an i?-pulse sequence is concatenated k times, then the 
effective noise strength is 



W ^ ^fc(fe+3)/2 



(cero) " (Jro) 



(145) 



where tq is the pulse interval and c is a constant of order 
one. Increasing the concatenation level produces higher 
order decoupling, reflected in the /c-dependent power of 
ero in Eq. (jl45|) . but also lengthens the pulse sequence, 
reflected in the fc-dependent power of R. Thus there is 
an optimal concatenation level fc, given by 



fcn 



Uogfl(l/cero) - IJ, 



(146) 



where [-J denotes the "floor" function. Using this opti- 
mal value of fc, we flnd that the optimal effective noise 
strength satisfles the bound 
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If a time-symmetric i?-pulse sequence is concatenated k 
times, then the effective noise strength is 



Vdd 



(148) 



which yields 

VuD'^/iJro) < R-^'^ (cero)'°^«('/"""'~' (149) 

after choosing the optimal value of k. 

Optimal noise strengths for the universal and time- 
symmetric sequences, plotted in Fig. [TUl are orders of 
magnitude lower than the noise strengths achievable 
without concatenation, shown in Fig. [5j Though longer, 
the time-symmetric sequence performs much better when 
ceTo is sufficiently small. 

When the pulses have a finite width 5 and consequently 
experience systematic errors that arise from the time- 
independent noise Hamiltonian that is on during the 
pulses, there is a floor on the effective noise strength, 
namely 



^'"^ > 4:R5J. 



(150) 



As the level k increases, rj'^^ falls as in Eq. (jl45p or 
Eq. (I148P as long as it remains well above the floor, but 
reaches a plateau as the floor is approached. Such behav- 
ior was observed in the numerical simulations reported in 
p,0] . This floor might be substantially suppressed by us- 
ing Eulerian pulse sequences as in Sec. IVI CI 
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universal decoupling sequence 
- time-symmetric sequence 



o Q 




FIG. 10: (color online) Upper bounds on the effective noise 
strengths achieved by the concatenated universal DD pulse 
sequence (blue dashed line, based on Eq. (|147|) with R — 4) 
and by the concatenated time-symmetric DD pulse sequence 
(red solid line, based on Eq. p49|) with i? = 8), as a function 
of cero, where c is defined in the text. 



A. Magnus expansion analysis 

The noise Hamiltonian has an unambiguous decompo- 
sition into two parts: H = Hs+Hcn-, where Hb = I^Bq, 
Heir = J2a ^a, and {Sa} is a basis for the trace- 
less operators acting on the system. For a level- 1 pulse 
sequence with duration T, the toggling-frame time evo- 
lution operator is U{T) = exp(f7(r)); writing n{T) = 
—iH'-^^T, we may regard H'-^'> as the level-1 "effective 
Hamiltonian." Like H, H^^^ has an unambiguous de- 
composition into two parts. 



(1) 



(151) 



and we may deflne parameters that characterize the ef- 
fective noise at level 1: 



(152) 

Now, we can analyze the level-2 pulse sequence just as we 
did the level-1 sequence, but with the level-0 noise Hamil- 
tonian H = H^^^ replaced by the level-1 effective Hamil- 
tonian H^^K Proceeding in this way, we can estimate 
properties of the toggling-frame time evolution operator 
[/('^^ for the level-A: pulse sequence using the level-(fc— 1) 
Hamiltonian H^''~^\ At each level, we can define noise 
parameters /S'-'^-', J'-'^-', and e'-'^'' as in Eq. ()152|) . and derive 
recursion relations that relate the level-A: noise parame- 
ters to level-(fc— 1) noise parameters. 

To understand how first-order decoupling is achieved 
by the level-1 sequence, we assumed that the toggling- 
frame Hamiltonian is constant in the interval between 
pulses. For the concatenated sequence at level 2 and 
above, this assumption is not true, since the interval in 
between the level-A: pulses contains a complex level-(fc— 1) 
pulse sequence. However the unitary operator describing 
the evolution from the end of one level-fc pulse to the 
beginning of the next level- fc pulse is equivalent to the 
evolution operator that would have been derived from 
the constant Hamiltonian i/C^^^) during the pulse in- 
terval. Thus for the purpose of understanding the time 
evolution in the toggling frame resulting from the level-fc 
sequence, it does no harm to imagine that the Hamilto- 
nian is constant between pulses and do the analysis just 
as for the level-1 sequence. 

For a sequence that achieves first-order decoupling, fii 
at each level is a pure bath term 



(153) 



where T'-'^'' = R'^tq is the duration of the level-fc sequence, 
constructed by concatenating fc times a sequence with R 
pulses. Suppose we consider a pulse sequence such that 
each pulse either commutes or anticommutes with each 
of the traceless operators in the set {Sa} (the argument 
below can be easily adapted to more general pulse se- 
quences). Under this assumption, the second-order term 
fl2 in the Magnus expansion has no pure-bath compo- 
nent (sec Appendix [G)) and thus contributes only to Hcj) ■ 
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Therefore H^^^ arises from Jl^'^''' and the pure bath com- 
ponent of 0,^1 = J2'^=3^^'' ■ shown in Appendix iHj 
the norm of the pure-bath component of f2^'^'' is no larger 
than Ijfij'"^ |j ; it fohows that we may choose such that and 
jg upper bound on 



\\^r{T)\\ + \\dii{T)\\. 

From Eq. ([50]) and Table H] we see that 

\\^>l\\ < 4'^ (Z'^-^^rC^)) (e('^-i)r('=))' , (155) 
where the "constant" c'^^ actually depends on the value 



(154) 



I + H (^,(fc-i)T(fe)) + 9.43 ' , (156) 



assuming e^'^-i^rC^) < 0.54 {e.g., 4 ' = 0.44 for 
^{k-i)j^{k) ^ Q i ^{k) ^ Q_24 for e('="i)T('^) = 0.01). 
Recalling Eq. (|153p . we conclude that 

^^(fc-i) (^.(fc-DyW)'. (157) 

Though Eq. (|157p has been expressed as an equality, the 
right-hand side is actually an upper bound on 

The level-fc error Hamiltonian i?crr arises from Q,'"^^ 

and the traceless component of fi>3. It is shown in Ap- 
pendix [H] that the norm of the traceless component of 
fi^'^'' is no larger than 2||r2j*'''||; therefore we may choose 
J^*^) such that J('=)r('=) is an upper bound on 

\\ni''\T)\\+2\\n^^l{T)\\. (158) 

(If the system is a single qubit, then the norm of the 



traceless component of fij'^'' is no larger than and 
thus the factor of 2 in the second term can be omitted.) 
Therefore, again using Eq. ([50]) and Table |I] we find 



(fc)i 



(159) 



where 



+9.43(e('^-i)T(^))' 



(160) 



assuming e^^'^i^rC^) < 0.54 {e.g., 4''' = 0.588 for 
^ 0.1 and 4''^' = 0.505 for e^'^'^lT'^''-^ = 0.01). 
Eq. (jl57p can be rewritten as 

pik) ^ p{k-l) ^ j^(k) ^ (161) 

W=4'=)j(^-i)(e('^-i)rW)', (162) 



where 



K 



and iterating this equation yields 

^C^) =13 + + /\ + . . . + A'(fc) 



Ak) ^ (3(k) + j{k) 



(163) 



(164) 



The solution to the recursion relations Eq. (|159p . p62p . 
(|164p cannot be expressed easily in closed form, but the 
properties of the solution can be grasped if we assume 
that 



Jk)Jk-i) 



< ce 



(165) 



for each k, where c is a constant. That is, if we iterate 
the recursion relations to estimate J^^\ our assumption 
is that Eq. (|165p is satisfied for all k < £. Then using 
y(/c) ^ R^yq, we can replace Eq. (fT59)) by 



=(cero)i?''-j('^-i), 
which has the solution 

jW =(ceTo)"i?'('=+i'/V 



(166) 



(167) 



where = J 

The effective noise strength for the level-fc sequence is 

= ||7jW||r(^-) = j(^-)i?Vo 

= (cero)^ ( Jro), (168) 



so that 



(fc) 



(169) 



therefore the optimal suppression of the noise strength 
is achieved by choosing the level k to be the largest in- 
teger such that i?'^+^(cero) < 1, or equivalently, fcmax = 
Llog^(l/ceTo) - IJ [Eq. 

For example, if cetq = 10 and i? = 4, we choose 
fcmax = 3 {i.e., a sequence with duration T*^^-* = 64ro) 
and obtain 'Uj^y)'^"^^ / {Jto) = 2.6 x 10^'^, an improvement 

by a factor of 60 over the noise strength 7]^^ achieved by 
the level- 1 sequence. 

The expression for 77^^ in Eq. ()168p is the exponen- 
tial of a quadratic function of k, minimized at fc = 
log^(l/cero) — 3/2. The nearest integer differs from this 
optimal value by at most 1/2; substituting fc -I- 1 = 
log^(l/cero) into Eq. (|168p . we conclude that the op- 
timal effective noise strength satisfies rjuu^ /(■^'''o) < 

it:-i(cero)^'°««^'/""'''-^ [Eq. (iMTl) ]. 

The condition Eq. p65p . used in the derivation of 
Eq. (|147p . can be justified for c = 0(1). Suppose for 
example that J is small compared to In that case. 
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gC^-i) grows slowly with k, and it is a good approxima- 
tion to assume e*^*^"^) ~ e. The optimal value of k is 
chosen such that i?*''+^(ceTo) < 1 and hence 



(170) 



Using Eq. (|160p wc see that Eq. p65p applies for fc < 
^max provided that 



^ (ci?)"^ + y (ci?)"^ + 9.43 (ci?)"^ 



which for i? = 4 is satisfied by 

c= 1.027. 



(171) 



(172) 



For consistency, we note that with these values Eq. (|170p 
yields e^'^'^^T^'^) < 0.244 < 0.54, as assumed in the 
derivation of Eq. ()160p . 

Wc can also check the self consistency of the approx- 
imation e''''"^) ~ e. Using this approximation together 
with Eq. ([T651) and Eq. (fTeTj) we find 

^(fc) ^ cf ->j(k-i) (^e(fc-i)rW)' 

R'^(^+3)/^(ceTo)\ce){jTo), (173) 



< 



/ 



and hence, using Eq. (|168p 



(fc) 



(174) 



Since jy^^ ^ 1 for 1 < /c < /smax, and Eqs. (|156p and 



(fTeO]) yield c. 



(fe) 



< 



-2 J 



we 



conclude that /•f^'^^ <C e for 



each k. Thus for J <C /3 we have e ~ /3 and e''^^ ~ /S*'^' ~ 
/? ~ e for each fc, where we have used Eq. ()164p . 

Numerical iteration of the recursion relations confirms 
that the approximation eC^"!) ~ e works well for J//3 < 
0.3, and that our estimate of Ty^Q*'' is reasonably tight in 
that case [6l|. For J > /?, though, e ~ J and e^''^ < e 
for I < k < fcnjax! we may still use Eq. p65p to derive 
an upper bound on 77^^*' in that case, but our estimate 
Eq. (|147p becomes overly pessimistic [6l|. Indeed, the 
case J ^ /3 is favorable for DD, since the bath dynamics 
is relatively slow and the system-bath coupling, which 
DD suppresses, is larger to begin with. For an analysis 
of concatenated DD in this case see Ref. [To| . 

If we concatenate a time-symmetric pulse sequence, 
which achieves second-order decoupling, then we may re- 
place Eq. (fT59)) by 



J(^) =2cf j(^-i) (eC^-i^rW)' 



(175) 



(the factor of 2 can be omitted if the system is a qubit), 
and we can also improve the estimate of C3 to 



4^)^^+9.43 (eC'-DrWy 



(176) 



where e^*^ i)^^^' < .54. Defining c for a time-symmetric 
sequence by 



2c. 



(fc) 



Eq. (|175p becomes 

= (cero)'i?2^j('="i\ 



(177) 



(178) 



which has the solution 

jW ^ (ceTo)"^i?'^''^'+i)j, (179) 

and thus 

V^^l=R''^'+^HceTof''iJTo). (180) 

The noise strength is optimized by choosing the largest 
integer k such that k + ^ < log^(l/ceTo). For exam- 
ple, if i? = 8 and cetq — 10~^, we choose fc^ax = 2 
(i.e., a sequence with duration T*^^) = 64ro) and obtain 

'7dd""'V('^'^o) = 1-7 X 10"^, an improvement by a fac- 
tor of 30 over the noise strength rj^^ achieved by the 
level- 1 sequence. The optimal noise strength satisfies 
V^Z'^/{Jro) < (ceTo)'°'^«('/''^°^-' [Eq. mi]- 



If we make the approximation 



(fc-i) 



e , then, be- 



cause R'^~^^{c€To) < 1 for the optimal value of k, we have 



(181) 



Using Eq. (fT76)) we see that Eq. ([T77| applies for k < 
fcmax provided that 



- + 9A3{c'^R)-^ 
9 



which for i? = 8 is satisfied by 

c= 1.332. 



(182) 



(183) 



As in our analysis for the non-time-symmetric case, the 
approximation eC^^^) ~ e is reasonable, and our estimate 
Eq. (|149p is fairly tight, if J is small compared to /3. The 
upper bound Eq. p49p applies more generally, but it is 
far from tight if J is much larger than (3, in which case 
eC") < e for 1 < A: < fc,„ax- 



B. Including pulse errors 

How is this analysis affected if the pulses are imper- 
fect? The answer depends on the degree to which the 
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pulse errors are systematic and reproducible, rather than 
random. As in our discussion of Eulerian decoupling, let 
us assume that the errors arc systematic. This assump- 
tion is reasonable if the pulse errors arise from the time- 
independent noise Hamiltonian that is "on" during the 
pulses, rather than from variations in the pulse shape. 

In the recursive analysis of the concatenated pulse se- 
quence, the effective Hamiltonian ijC^'"!' incorporates all 
the damage caused by the pulses errors at level k—1 and 
below. Because the pulse errors are systematic, we may 
use the same H^^~^^ to describe the noise in each inter- 
val between level-fc pulses. Suppose we imagine, at first, 
that while the pulses at level k — 1 and below are noisy, 
the pulses at level k are ideal, and denote by j'*^' the 
upper bound on ||iJorr || under this fictitious assumption. 
Repeating the derivation of Eq. (jl59p yields 



(184) 



But now we must relate j'^^^ to j'^^^ by estimating the 
effects of the pulse errors at the top level. 

The noise in these level- fc pulses is governed by the 
level-0 error Hamiltonian iJ^rr rather than the effective 
level-(fc— 1) error Hamiltonian Hi^r^^ ■ We could adapt 
our analysis of the Magnus expansion to this new situ- 
ation, using a different upper bound on iJon- during the 
pulses than in the interval between pulses, but then we 
would face the complication of revising our estimate of 
all the higher-order terms in the expansion. To avoid 
that complication, we use a different approach. As in 
Sec. I VII A 11 we assume that the Hamiltonian describing 
the sequence of noisy pulses at level k deviates in opera- 
tor norm from the Hamiltonian describing the sequence 
of ideal pulses at level k by at most 2J during a total 
time interval if there are R pulses each with width 
6. It then follows from Lemma |3] in Appendix [Cl that 



< 2R5J, 



(185) 



where Vt'^^^ includes pulse-error corrections at all levels 
while Vt'^^^ includes pulse-error corrections at level fc — 1 
and below but not at level k. From Eq. (|I9| in Appendix 
m we find that 



< 2S''^R5J, (186) 



where the "constant" is close to one ff and 
are both small; therefore we obtain an upper 
bound on j'^^^: 



< j(fc)+2 



/T 



(fc) 



(187) 



If at each level the second term in Eq. (|187p is small 
compared to the first term, then our previous analysis of 



the pulse sequence remains a good approximation, and 
we conclude that the pulse errors do not compromise the 
effectiveness of concatenated DD very much. However, 
the second term imposes a floor on (our upper bound on) 
the effective noise strength 



(188) 



A noteworthy property of Eq. (|187p is that only the 
pulse errors at the top level appear explicitly on the right- 
hand side. The errors at lower levels are included im- 
plicitly, through their contributions to J^''~^^ and e'^'^~^\ 
Accordingly, Eq. (|187p captures the idea that the cumu- 
lative effect of the errors in the R'' pulses is smaller than 
might have been naively expected, because errors that 
occur at lower levels in the pulse sequence become sup- 
pressed by the upper level pulses. This is an important 
feature of concatenated DD. 



IX. BEYOND THE LOCAL-BATH 
ASSUMPTION 

A key element of the noise model formulated in Sec. [Tl] 
is the local-bath assumption: at any given time, the noise 
Hamiltonians Ha and Hi, associated with distinct circuit 
locations a and b act not only on disjoint sets of qubits 
but also on disjoint baths. This assumption is impor- 
tant because it allows us to ignore interactions among 
different circuit locations and thus assign an effective 
noise strength t^dd to each DD-protected gate individ- 
ually. The local-bath assumption may be a reasonable 
approximation to noise in actual systems, at least in some 
cases, but it is not strictly satisfied; surely there are bath 
degrees of freedom that couple to multiple qubits, even 
while these qubits are participating in distinct gates. Can 
our analysis be extended to noise models that include cor- 
relations that arise because qubits participating in dif- 
ferent gates at the same time couple to common bath 
variables? 

Accuracy threshold theorems have been proved for 
Hamiltonian models of correlated noise in [3ll - l33j . Per- 
haps similar methods can be applied to DD-protected 
circuits, but this seems to be a technically challenging 
problem which we leave for the future. 

However, there is an easier problem that already arises 
when we consider just a single circuit location, and disre- 
gard how the noise at one location is correlated with the 
noise at another location. How is our analysis affected 
if the qubits at this location couple not just to a local 
bath comprising nearby bath degrees of freedom but to 
a global bath that includes bath variables that are far 
away? Of course, our previous analysis still applies if we 
replace the norm H-ffs.all of the local-bath Hamiltonian 
by the norm \\Hb\\ of the global-bath Hamiltonian in 
Eq. ([8|) and Eq. (fTO)) . but the trouble with this approach 
is that IjiJsll is a huge number that scales linearly with 
the volume of the bath, while an accuracy threshold cri- 
terion should be stated in terms of intensive quantities 
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that are independent of the size of the system and bath. 
On the other hand, we expect on physical grounds that 
the bath has a decomposition into local subsystems, and 
that the coupling of a given bath subsystem to a sys- 
tem qubit decays as the distance increases between the 
bath subsystem and the qubit; if in contrast each sys- 
tem qubit were coupled with constant strength to bath 
subsystems arbitrarily far away, the noise would be unac- 
ceptably strong and coherent manipulation of the system 
would be hopeless. Even though the local-bath assump- 
tion formulated in Sec|ll]may not hold exactly, a sensible 
noise model should be quasi-local — qubits ought to in- 
teract only very weakly with bath subsystems that are 
far away. In this case, can we express the effective noise 
strength in terms of intensive quantities? 

To be concrete, consider a noise model in which a single 
system qubit is immersed in a bath of Ni, non-interacting 
spins in an external magnetic field. The noise Hamilto- 
nian, assuming Hs = 0, is 

H = HB + HsB = J2^B,^ + J2HsB.^, (189) 

i i 

where 

HB,^ = Is ® B°, 

Here, the index i = 1, . . . ,Nt labels the bath spins and 
{(Ja, OL = 1,2,3} are the Pauli operators acting on the 
system qubit. We may define the strengths of the indi- 
vidual terms as 

\ = \\BsbA and 6, = = (191) 

and the strength of the system-bath coupling can be char- 
acterized by 

J = ^A, > IIHsbII; (192) 

i 

we assume that the sum converges to a (small) finite value 
in the limit Nt oo. On the other hand, the quantity 

^3 = Y,b^>\\HB\\ (193) 

i 

is not expected to remain bounded as A^;, — >■ oo. 

Now consider how the bath parameters {6,} enter the 
Magnus expansion for a DD memory sequence or for a 
DD-protcctcd gate applied to the system qubit. The 
Hamiltonian Hf,[{t) is 

HM{t) ^Hb + H'it) = Y,HB^r + H'{t), (194) 

i 

where H' {t) = or H^,,{t){^ HsB{t)) as in Sec. lV^ so 
that ||i7'(t)|| < J. Furthermore, bath operators acting 
on different bath spins commute: 

[Bl B^] = [Bl Bf] = [B^,Bf] = 0, Vz =^ j; (195) 



the only nonvanishing commutators of bath operators are 
[BO,Bf] and [-Bf,^/] (for any spin ^). 

The bath parameters {bi} do not contribute to fli{T), 
so consider f22(r). To estimate the integral in Eq. ([M)) 
(taking F = so that BM{t) = B{t)), we need an upper 
bound on the commutators. We observe that 

\\[Bb,BoAs2)]\\ = \\J2[BBa,BsBAs2)]\\ 

i 

< ^HbA ■ \\HsbAs2)\\ < 2W, (196) 

i 

where we have defined the single-spin bath parameter 

6= max||i7B,,|| . (197) 

i 

We also observe that 

||[i?err(si),i?crr(.S2)]|| < |1 ^ [i?SB,, (-Si ), i^SB.j (^2 )] |1 

id 

<^2A,Aj=2J^ (198) 

Together, Eq. ([TM| and Eq. ([TM]) imply 

mM{si),BM{s2)]\\ <4W + 2J2, (199) 
and plugging Eq. into Eq. ^ yields 

||n2(T)|| < (4W + 2J2) < (jT)[{b + J)T]. (200) 

Using the local-bath assumption we would conclude 
\\[Bm{si),Bm[s2)]\\ < 4/3J-h2j2; The result Eq mUh 
matches the conclusion we would reach under the local- 
bath assumption, but with (3 now replaced by b. 

Similarly, upper bounds on the higher-order Magnus 
terms can be also be expressed in terms of J and 6, 
though the "replace /? by & rule" does not quite work 
beyond second order. Consider, for example, one triple 
commutator that occurs in n^lT): 

\\[HB,[HeAs2),Herr{s3m\ 

= \\Y.[BBM,[HsBAs2),HsBA^3m 
ijk 

<J2{\\iHB.,[HsBAs2),HsBAs3m 
A\[Hbj,[HsbAs2),HsbA^3)]]\\) 

< 2(26) 2\,\j = 86J^ (201) 

In contrast, in the local-bath model we could upper 
bound the corresponding triple commutator by 4(3 J'^. 
Simply replacing /3 by 6 gives the wrong answer by a fac- 
tor of 2, because it fails to take into account that there 
arc two different bath spins that do not commute with 
[BsB, 1(32) T HsBjis A] for i 7^ j. Similar factors, depen- 
dent on n, occur in the higher-order nested commutators 
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contributing to J7„(T), but these factors do not depend 
on the total number of bath spins Ni,. 

We could also include quasi-local interactions among 
the bath spins, and still obtain an upper bound on each 
Magnus term expressed in terms of intensive quantities. 
Suppose for example that we include in the bath Hamil- 
tonian the additional term 

^E^s.fe)' (202) 

where 

Hba,j) = Is ® S(»,> (203) 

acts trivially on the system qubit but nontrivially on the 
pair of bath spins (ij). In that case there will be an 
additional term in our upper bound on \\[Hb , Hci-t{s2)]\\' 

hi 

<2^A,||i/B,(y>ll <2cJ, (204) 

where 

c = max . (205) 

Thus in the modified upper bound on ||ri2(T')|| we replace 
6 by 6 + c. The expression for c includes a sum over all 
bath spins, but converges to an intensive quantity if the 
interaction between bath spins i and j decays sufficiently 
rapidly with the distance between the spins. Similar con- 
vergent sums occur in the upper bounds on higher-order 
Magnus terms. 

Even when our bounds on the Magnus expansion are 
intensive, they might still be useless, if each local bath 
subsystem has a Hamiltonian with a large norm. In that 
case, though, there is another method that might suc- 
ceed, which relates the effective noise strength to the fre- 
quency spectrum of bath correlations. We turn to that 
method next. 



where 

Sa{t) - Uc{t)^SMt), B^t) ^ e**^«B„e-'*^«, 

(207) 

and we can study the interaction-picture time evolution 
operator using the Magnus expansion defined by this 
Hamiltonian. This expansion has the big advantage that 
the interaction picture sums up the effects of the free bath 
dynamics to all orders in /3; therefore, higher-order cor- 
rections are small provided J is small, even though (3 may 
be large. But there is also a substantial disadvantage: be- 
cause the interaction picture bath operator Ba (t) is now 
time dependent, a pulse sequence that achieves first-order 
decoupling in the toggling frame may not achieve first- 
order decoupling in the interaction picture. 

On the other hand, if the bath operator Ba{t) is in 
some sense slowly varying, then first-order decoupling 
might be satisfied to a good approximation. Though the 
rate of change of the operator Ba {t) is actually of order 
/3, if the state of the bath has suitable properties, then the 
expectation value of Ba (t) in that state may vary slowly; 
then DD may work well because the typical frequencies 
of the bath are sufficiently small, even though /? may be 
large. 

When estimating yyoo using the Magnus expansion, 
we did not make any assumption about the state of the 
bath. The new estimates we derive in this Section de- 
pend on the bath's frequency spectrum and hence im- 
plicitly on the bath's state. In order to obtain a simple 
formula for r/DD we will impose a further limitation on 
the noise model that was not needed in the Magnus ex- 
pansion analysis — we assume that the state of the bath 
is discarded at the end of each circuit location, and re- 
placed by a fresh bath state at the beginning of the next 
location. Thus we will include the effects of the bath's 
memory in analyzing the effectiveness of the DD pulse 
sequence at each circuit location, but we assume that 
noise correlations between consecutive circuit locations 
can be neglected. We recognize the artificiality of this 
noise model, but we adopt it anyway because it allows us 
to derive an explicit expression for yyoD- See Appendix [j] 
for further discussion. 



X. DYNAMICAL DECOUPLING AND BATH 
CORRELATIONS 



So far, we have described how to analyze the perfor- 
mance of DD using the toggling frame and the Magnus 
expansion. Another method is to use the interaction pic- 
ture defined by Hc{t) + Hb', that is, to transform away 
both the control sequence acting on the system and the 
free bath dynamics. In that case, the interaction-picture 
Hamiltonian is 



H{t) = [ul{t)<g>ul{t)\H,,,[Uc{t)<E>UB{t)] 



J2Sa{t)<E>Ba{t), 



(206) 



A. Dyson expansion 

In the toggling frame, it is convenient to analyze DD 
using the Magnus expansion because for a well chosen se- 
quence of ideal pulses f2i is a pure bath term, and the re- 
maining noise acting on the system resides in the higher 
order terms. But if we use the interaction picture in- 
stead, so that first-order decoupling is not exact even for 
ideal pulses, it is simpler to estimate the effective noise 
strength ryoD using the Dyson expansion rather than the 
Magnus expansion. The interaction-picture time cvolu- 
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tion operator U{t) = \ul{t) ® U]^{t) \ 11(1,0) i 



ij{t) = Tcxp -i / dt'H{t') 



(208) 



where T denotes time ordering. For the local-bath noise 
model, augmented by the assumption that the bath is 
refreshed at the beginning at each circuit location, the 
arguments in Appendix [j] show that the noise strength fj 
can be expressed as 

= msix(2l - U{T) -U\T)) ; (209) 

a,!*) \ / 

here T is the duration of the location, the expectation 
value (•) is evaluated in the pure state (g) |$a) where 
|$a) is (a purification of) the initial state of the bath at 
the beginning of location a, and the maximum is with 
respect to all circuit locations and all system states. As 
is also shown in Appendix [Jl 



77^ < max f dti ( dt2 {H{ti)H{t2) 
Jo Jo 



,JT 



l-JT-hjT) 



(210) 



For each term in the expansion Eq. (|206p the expectation 
value in the product state factorizes and we have 



f < 



a, 13 

1 



1-JT- ^{JTf 



(211) 



where the maximum over circuit locations and system 
states is implicit. 

Now suppose that the bath's time correlations are sta- 
tionary, i.e., that the expectation value {Ba{ti)Bp{t2))B 
is a function of the time difference ti — t2] this will be true 
if the initial state of the bath commutes with Hb , for ex- 
ample if the state is a mixture of energy eigenstates such 
as a thermal state. Then the bath correlation function 
may be expressed as 

/OO J 
^e-^^'^'--'-'^ (212) 
-OO 27r 

and Eq. ()21ip becomes 

/OO 
— y^{Sa{uj)Si3{-Uj))s Kap{uj) + 

where 



(213) 



dt e"'"* 



(214) 



and the ellipsis indicates the terms higher order in J. 
Defining the bath's spectral function J^^ ,■ by 

KM=2nY,Jl^J{uj-uj,), (215) 

i 

our expression for (the square of) the noise strength is 



< max ^ Jlfi.i{SaiiOt)Si3{-uJi))s + 



(216) 



Thus, speaking loosely, DD is effective if ^^(a;) is sup- 
pressed when a; is a "typical frequency" where the bath 
spectral function has support. We use the symbol 

advisedly, because \J Jap v ^""^^ ~ max |jiJerr|| , scales 
linearly with the strength of the system-bath coupling. 

The operator Sa{uj) can be written as TSa{ojT), where 
Sa is dimensionless. Adapting our terminology to this 
correlation function analysis, let us say that a pulse se- 
quence achieves nth-order decoupling if the first n terms 
in the Taylor expansion of Sa{(^T) vanish, so that 



Saiuj) = T {SaA^Tr + o[(c^r)"+i]) 



(217) 



Equivalently, the pulse sequence achieves nth-order de- 
coupling provided 



dtt"'Sait)=0 



(218) 



for all a and for m = 0, 1, 2, . . . , n — 1. Denoting the 
norm of the operator Sa,n by Ca.n, we find that for a 
pulse sequence achieving nth-order decoupling, the noise 
strength is 



< 



E 



2ti 



1/2 



(219) 



where now the ellipsis includes corrections both higher 
order in coT and higher order in the Dyson expansion. 
Therefore, ignoring the 0[{JT)^] corrections higher or- 
der in the Dyson expansion, nth-order decoupling im- 
plies that DD suppresses the effective noise strength by n 
powers of ujT where a; is a characteristic bath frequency, 
rather than n powers of eT as in our previous analysis 
using the Magnus expansion. 



B. Universal decoupling 

To be concrete, consider the case of a single qubit with 
noise Hamiltonian 



(220) 



For a sequence of ideal zero-width Pauli operator pulses, 
the time-dependent system operator in the interaction- 
picture Hamiltonian becomes 



<7a{t) = Fa{t)cra, 



(221) 
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where Fa{t) = ±1 (+1 if Uq commutes with Uc{t) and 
— 1 if cTq anticommutes with Uc(t)). For the universal 
decouphng sequence 

UcitBo) = ZIXIZIXI, (222) 

these functions are 

= (+ + —), 
F, = (+--+); (223) 



here, for example, -Fx = (+ H ) means that has 

the value +1 in the intervals [0,ro] and [ro,2ro] and has 
the value —1 in the intervals [2ro,3To] and [3ro,4ro]. All 
three functions integrate to zero over the interval [0, Atq] 
and hence achieve first-order decoupling. Evaluating the 
Fourier transform 

Faiuj)= dt e~"^*Fa{t), (224) 
Jo 

we find 



{-iLu)F^{Lj) = {x- 1)(1 +x-x^ ^x^) = -{x^ - if ^ Ae-^i^^° sin^iiUTa) = 4(wto)^ + . . . , 
{-ibj)Fy{ijj) = {x - X + x"^ - x^) = -{x -l){x'^ ~ l)/{x + 1) 

= 2e"2'"^" tan(wTo/2) sin(2a;ro) 2(wto)^ + . . . , 
{-iuj)F,{Lu) = {x-l){l-x-x^+x^) = {x-l){x'^-lf/{x + l) 

= 4ie"^*"^° tan(wro/2) sin^(a;ro) = 2i{ujTof + . . . , 

(225) 



where x 



The low-frequency suppression of Therefore we have 



Fy{uj) is stronger by a factor of 2 than the suppression 
of Fx{uj) because the period of Fy(t) is shorter than the 
period of Fx{t). The function Fz{uj) is suppressed by a 
further power of wtq because Fz{t) is time-symmetric: 
F^(4to — t) = Fz{t). Indeed, for any function F{t) satis- 
fying F{T - t) = F{t), we have 



F{u) 



dt 



F{t) = 



dt 



F{T - t) 



dt e-*"(^-*) F{t) = e-'"^F{-uj); (226) 



thus F{uj) = e-*'^^/2Fevon(w), where Feven(a;) IS an even 
function of a;, and F{uj) = 0(a;^) if F{0) vanishes. 
The time-symmetric pulse sequence 



UcitBo) = IXIZIXIIXIZIXI, 



(227) 



achieves second-order decoupling because all three func- 
tions obey F{t) = F{T - t): 



F, 


= (+ 


+ ++), 




Fy 


= (+ 


- + -- + -+), 




F, 


= (+ 


-- + + --+). 


(228) 



Compared to the four-pulse sequence, the functions F^ 
and Fy are repeated twice, but with a sign flip, so the 
Fourier transform is suppressed by an additional factor 
of 1 - x'' = 2ie-2'"^o sin(2a;ro) sa 4i(wto). The function 
Fz is repeated without the sign flip, so its Fourier trans- 



form is multiplied by 1 



2e 



-2iujrQ 



cos(2a;ro) « 2. 



F^{io) 
Fy{uj) 
Fz{uj) 



-16to(wto)^ 
~4:To{ujTof 4 



(229) 



Again, different types of low-frequency Pauli noise are 
suppressed by different (constant) factors, with the heav- 
iest suppression for phase {i.e., (Tz) noise. By altering the 
pulse sequence, the stronger suppression could be applied 
to or ay noise instead. 



C. Finite-width pulses 

If the pulses are not ideal, then first-order decoupling 
will not be exact. For example if the pulses have nonzero 
width, then there is a contribution to fj^ of the form 

(dtld<2)pwl] ('5a(il)^/j(i2))s {B^{t^)Bp{t2))g 

a, 13 

— ^ -/Q;g,i('S'Q,pw(wi)S';3,pw(— Wi))s; (230) 

here / ((ii)pw denotes integration over the nonzero- width 
pulses, and 



^a.pw(^) = y (di)pw e-'"' S^{t). (231) 

For a sequence of N pulses, each with duration 5, we 
expect S'q,.pw('^) ~ A^f^llSall ioxiot <^ 1. Comparing with 
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Eq. pi7p . we conclude that for a pulse sequence that 
achieves nth-order decoupling in the ideal case, pulse- 
width corrections are small provided 

N5/T < (wT)" (232) 

where w is a typical bath frequency. This is similar to the 
criterion we found using the Magnus expansion, except 
with the frequency uj now replacing the operator norm e. 

For an Eulerian sequence with reproducible pulse er- 
rors, Sa{uj) vanishes in the limit cj — > (by the same 
reasoning as in Sec. IVI Cp : therefore first-order decou- 
pling is exact. Furthermore Sa{u!) is an even function of 
w for any time-symmetric pulse sequence, and therefore a 
time-symmetric Eulerian sequence achieves second-order 
decoupling. 

D. Gaussian noise 

We have seen that, while in our previous analysis we 
required /3 = max||7JB|| to be small compared to I/tq 
in order to get a useful estimate of ??dd, the analysis 
based on bath correlation functions can provide a useful 
estimate even if /3 is large. However we still require that 
J = max ||i?ori || is small to justify neglecting the higher- 
order corrections in the Dyson expansion in Eq. (|210p . In 
some cases it is possible to go further and express these 

I 



To derive Eq. (f235|) . we use Eq. (|234l) and \\(T<y{t)\\ = 1, 
and we note that the value of \K{i,j)\ does not depend 
on the time ordering of ti and tj. We conclude that, 
in the case of Gaussian noise, the sum of all corrections 
higher than quadratic order in the Dyson expansion can 
be bounded above by 

oo 

^ A"Vn! = e'^ -1-K, (236) 

and that the quadratic term provides a good approxi- 
mation to the effective noise strength for K sufficiently 
small. 



higher-order corrections in terms of correlation functions 
as well, thereby obtaining an estimate that makes sense 
even if the system qubits are coupled to bath operators 
with large norm (e.g., the quadrature amplitudes of a 
bath of harmonic oscillators). 

Consider, for example, a single qubit coupled to bath 
operators whose correlators obey Gaussian statistics in 
the interaction picture: the interaction-picture Hamilto- 
nian is 

i/(i) =^a„(t)®S,(t), (233) 

a 

where the expectation value of an odd number of bath 
operators vanishes, and the expectation of an even num- 
ber of bath operators is 

{B{1)B{2)---B{2n)) 

= ^ K{ii,i2)K{i3,i4) ■ ■ ■ K{i2n-l,i2n)- 

contractions 

(234) 

Here the sum is over the (2n)!/2"n! ways to divide the 
labels 1, 2, . . . 2ri into n unordered pairs, and we use the 
shorthand B{i) = Ba^{ti), K{i,j) = {B{i)B{j))B- Thus 
terms of odd order in the Dyson expansion for fj^ vanish, 
and we may bound the (2n)-th order term as 



(235) 



E. Nonuniformly spaced pulses 

Another approach to analyzing DD is to use the Dyson 
expansion and to also expand Ba(t) in powers of Bot, 
thus obtaining a double expansion in powers of JT and 
j3T. In that case we might say that "nth-order decou- 
pling" is achieved if, in the expression for the interaction- 
picture evolution operator U{T), all terms of order T™ 
are pure-bath terms for m — 1,2,. ..,n. For the case of 
a qubit subject to pure dephasing noise {Bi = B2 = 0), 
it is shown in [l^, [s^ that in this sense nth-order decou- 
pling can be achieved by a sequence of X pulses with n-l-1 
pulse intervals, where the pulses are nonuniformly spaced 
in time. For general single-qubit noise, nth-order de- 
coupling can be achieved by a sequence of nonuniformly 



((2n)!io 



< 



^ dti...dt2nT {H{h)---H{t2, 

1 F 

— / dti...dt2n \K{tl,i2)---K{i 

2n — l -i ^2n ) \ 

^ ai,...Q2n contractions 



(2 



1 1 

— V {2Kr^—, where K^- dfds V |(i?„(i)i?M^)) 

contractions a,/3 
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spaced X and Z pulses with altogether [n + 1)^ pulse 
intervals [T^I, and for general m-qubit noise, {n + 1)^™ 
pulse intervals suffice |72l |. 

The corrections higher-order in T are not necessarily 
small unless both jiT <C 1 and JT <^ 1 are satisfied. 
However, the ideal pulse sequence constructed in [70j has 
the property 



dtt™Fa{t)—0, m = 0, 1, . . . n — 1, a — x,y,z. 

(237) 

(The sequence in [Ij, [S^l has this property only for 
a = z.) Therefore, even if /3T is not small, we can 
use the correlation function analysis to show that same 
sequence also achieves nth-order decoupling in the sense 
of Eq. (Uni). Therefore DD works effectively if wT < 1, 
where w is a typical bath frequency, provided that either 
JT ^ 1 or (in the case of Gaussian noise) K 1. The 
same remark applies to pure dephasing noise for the 
pulse sequence in [l^ [s^ . 



F. Concatenated dynamical decoupling 

Instead of using the Magnus expansion, we can analyze 
the performance of concatenated DD sequences using the 
Dyson expansion and bath correlation functions. As in 
Sec. IX Al we will suppose that the higher-order terms in 
the Dyson expansion can be neglected, and will focus on 
the lowest-order term Eq. (|216p . The objective is to show 
that, by concatenating k times a pulse sequence that 
achieves first-order decoupling, k-th order decoupling can 
be achieved, in the sense that Sa{oj) = 0[{ujT)'^]. 

To illustrate the idea, consider the simple pulse se- 
quence that decouples pure-dephasing noise for a single 
qubit: 



etc. Evaluating the Fourier transforms of these functions. 



UcituD) = XIXl, 



(238) 



so that the "level- 1" function multiplying in the in- 
teraction picture can be represented as 

fP = (+-). (239) 

When we concatenate the pulse sequence, i^i^'' is replaced 
by Fz'^\ in which i^i"'^' is repeated twice, but with a sign 
flip in the second repetition: 

J^i^) = (+ - -+), (240) 
and for higher-level sequences we have 

F^''> = {+-- + - + +-), 



+ 



we see that 



F^^\u)= dt e-''^'F^^\t), (242) 

"'0 



F^^\u) = {-^u:r\x-l){l-x) (243) 



and 



Fi^\u) = [l~x^'~')Fi'-'\ul (244) 
where x = e^*'^'^°, and hence 



Fi")(. 



fc=l 

2tj"^a;^/^sin(a;ro/2) 

■n 

X J^x^""" (2isin(2'=^2^To)) . (245) 



fe=i 



The leading behavior of this function for small wtq is 

Fi")(c^) = ro(i)"2"("-i)/2 (wro)" + • • • , (246) 



and therefore Eq. (|216p becomes 



1/2 



(241) 



(247) 

where we neglect corrections both higher order in the 
Dyson expansion and higher order in frequency. Naively, 
this expression for the effective noise strength t^dd is op- 
timized by choosing the level of concatenation n to be 
the largest integer such that 2"~^ (wtq) < 1 where lu 
is a "typical" bath frequency. Note, however, that for 
2" (oJTo) ~ 1 the higher-order corrections in (lotq) mod- 
ify ??DD by an 0(1) multiplicative factor. Note also that 
2"ro = T^") is the duration of the level-n pulse sequence, 
and thus the optimal pulse sequence has duration com- 
parable to a typical inverse frequency of the bath. 

Other concatenated pulse sequences can be studied 
similarly. Consider for example the universal DD se- 
quence. We have seen in Eq. (|225p that this sequence 
suppresses noise asymmetrically (the best suppression for 
az , the worst for ax), so we might choose to alter the se- 
quence at higher levels to provide more balanced noise 
suppression. But if we do not do that, the functions 
can be specified by augmenting Eq. (I225P with 
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(x^'")'^' (4j)cos(4'^-i(L.ro)/2) sin (4'^-i(c.ro)) i^i^'^H^), 



1-3; 



cos (4fe-i(wTo)/2) 



(248) 



r 



The weakest suppression of low-frequency noise occurs 
for Fx''\uj), where 

FW(a.) = (4'^ (za.ro) + • ■ ■) F(^-'\u;), (249) 

and hence 



IUTq 



k=l 



To 



( •)n4„(n+l)/2 (^^^^y _^ . . . ^ (250) 



where we neglect the terms higher order in ujtq. From 
Eq. (|216p we obtain the estimate of the noise strength 



1/2 



(") ^ An{n+l)/2 



(251) 



Noting that the universal DD sequence has length i? = 4, 
we see that Eq. ()25ip resembles Eq. ()168p . but with the 
operator norm e replaced by a bath frequency. 



XI. CONCLUSIONS 

We have derived upper bounds on the effective noise 
strength 77DD for DD-protected quantum gates, in terms 
of the parameters of a Hamiltonian noise model. From 
the upper bounds on the noise strength we can extract a 
noise suppression threshold condition, a sufficient condi- 
tion for DD-protected gates to outperform unprotected 
gates. We can also derive an accuracy threshold con- 
dition; when the noise parameters obey this condition, 
scalable quantum computing is possible. Our results 
show that DD, and in particular concatenated DD, can 
improve the gate accuracy and overhead cost of fault- 
tolerant quantum computing. 

Dynamical decoupling works when the noise varies 
slowly on a time scale determined by the pulse sequence. 
Therefore, estimates of the achievable effective noise 
strength depend on parameters quantifying the speed of 
the bath dynamics. We have used two different methods 



to quantify the accuracy of DD-protected gates, appro- 
priate for two different ways of characterizing the time 
variation of the noise. From the Magnus expansion in 
the toggling frame we derived an expression for j^dd in 
terms of the operator norm of the noise Hamiltonian; an 
advantage of this method is that 77DD does not depend 
on the state of the bath. From the Dyson expansion 
in the interaction picture we derived an expression for 
?;dd in terms of the frequency spectrum of bath correla- 
tions. While the bath frequency spectrum does depend 
on the state of the bath, the second method sometimes 
yields useful result when the first method fails, because 
the norm /3 — \\HB.a\\ of the local bath Hamiltonian is 
too large. Our correlation function analysis can remain 
applicable even in the formal limit /? — >■ 00. 

Our analysis of fault-tolerant circuits built from DD- 
protected gates applies only to Hamiltonian noise models 
satisfying suitable assumptions. For the Magnus expan- 
sion analysis we used the local-bath model; this allows us 
to study each DD-protected gate individually, ignoring 
noise correlations among distinct gates being executed 
in parallel at the same time. For the correlation func- 
tion analysis we used an even more artificial model, in 
which the state of the bath is refreshed after each DD- 
protected gate. This assumption allows us to include 
non-Markovian effects during the DD pulse sequence at 
each protected gate, but to ignore these effects when the 
DD-protected gates arc composed in a quantum circuit. 
It is clearly desirable to extend our analysis to models 
with more general noise correlations. 

Here we have proposed to combine DD with fault- 
tolerant quantum computing straightforwardly, by re- 
placing each gate in a fault-tolerant circuit by the cor- 
responding DD-protected gate. We have not studied 
systematically the improvements in fault tolerance that 
might be achieved using Eulerian dynamically corrected 
gates (53-l56| which are robust against pulse imperfec- 
tions. Nor have we considered the potential advantages 
of qubit encodings that allow gates and DD pulses to 
commute, so that both can be applied simultaneously. 
This latter strategy has been shown numerically to lead 
to robust gates for a spin bath model [65| . Perhaps other 
ways to combine DD with fault tolerance can be found. 
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leading to further gains in efficiency and accuracy. 



^^e^^*^) e , we first evaluate 
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Appendix A: Review of the Magnus expansion 

Here wc will briefly review some properties of the Mag- 
nus expansion that are used in our arguments. For a more 
detailed discussion, see jil ]. 

The foundation of the Magnus expansion is this theo- 
rem: 

Theorem 1. Suppose 



dt ^ ' 



(Al) 



Tht 



d °° R 

m^Y.^^'^lmiMit)]- (A2) 



dt ' — ' n 

n— 



Here the {B^} are the Bernoulli numbers defined by 



n=0 



(A3) 



d 

dX 
d_ f d_ 
di \dX 
d_ 

di 



dt 



dt 



n(t)e 



dt 



o(t) 



n=0 



(A5) 



In the last line we have used the identity 



^-^e-- = ;^^adS[A], 



(A6) 



n=0 



which can be verified by differentiating both sides k times 
with respect to A and then setting A = 0. Expressing 
M{t) as the integral of its derivative, we find 



M(t) 



dt 



/ dAV^adJ^ 
Jo „t^"! 

OO ^ 

n— ^ ^ 



dt 



n{t) 



dt 



n{t) 



(A7) 



Thus we have shown that M{t) = Oadnj^, [^^^(0]: 
where 



OO 



A 



(A8) 



which is inverted by 



A 



OO „ 

y^A" 



(A9) 



n=0 



Therefore, 



and adg [A] is defined by 

adl[A]^[B,[B, [■■■[B,[B,A]] 



(A4) 



('ad^[A] = A, and ad^[A] for n > 1 contains n nested 
commutators). The series converges provided \\rt{t)\\ < 

TT. 

Proof. To obtain a useful expression for M{t) = 



(AlO) 



from which Eq. (jA2p follows. 

Regarding the convergence of the expansion, we note 
that ||ad^|| < (2|jB||)", and that the series expansion of 
x/{e^ ~ 1) converges for < 27r, because the nearest 
poles to the origin in the complex x-plane are at a; = 
±27ri. Therefore the expansion in Eq. (|A2p converges for 
l|20(t)l| < 27r. 

□ 
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In the Magnus expansion, we express Q,{t) = 
12^=i^n{t), where r2„(i) is nth-order in M. Using this 
expansion, Eq. ()A2|) becomes 



Mit), 

J2-fSi^\t), n>2, (All) 



where 



(n-l) 



X! ado.jt)ada^^(t) • • •adn,^(t) [M{t)] ; 

,i2,...,ij 

(A12) 



here the sum is over nonnegative integers {ii,i2, • ■ • , ij} 
satisfying ii + i2 + ■ ■ ■ + ij = n — 1. We see that 



5W(t) = [r!„-i(i),M(0], 



(A13) 



and that SrP for j > 1 can be expressed as 

n-j 



2<j<n-l. (A14) 



The relations Eq. (|Alip . (|A13| . (|A14P provide an algo- 
rithm for generating the terms in the Magnus expansion 
recursively, and we use these recursion relations to derive 
our upper bounds on the higher-order terms. 



Appendix B: Even Magnus terms vanish for a 
time-symmetric Hamiltonian 

Here we prove the fact that, if HM{t) is time- 
symmetric, all even Magnus terms vanish. This was pre- 
viously known in the NMR literature, at least for the case 
of a piccewise constant Hamiltonian [JtI] . 

Lemma 2. If Hm{T - t) = HM{t), then n^iT) = for 
all even n. 

Proof. First we show that r2(T) is an odd function in 
A{t) = —iHM{t) when HM{t) (or correspondingly A{t)) 
is time-symmetric about T/2. Defining Ajy = T /2N for 
N a positive integer, the evolution operator from t = 
to t = T can be written as 

U(T 0) = lim e^(^)^"e^(^"^")'^" • • • e'^(T+'^")'^" 

^ -Ajv)A„ gA(Ajv)A„gA(0)A„ 

= lim eMO)^N^A{AM)AM , , , gA{^-A,^}Ati 



^ gA(2-A„)A„ . . . gA(A„)A„gA(0)Aj> 



(Bl) 



where in the second equality, we have used the time- 
symmetry A{T — t) = A{t). Taking the adjoint of 
Eq. jHll, and noting that A(t)t = -A{t), we find 



UUT,0)^ lim e-^W^"e-^(^")^' 



X e 



-A(f-A„)A, 



, g-A(A„)AiVg-A(0)A, 



(B2) 



Thus U^{T,0) has the same form as U{T,0), except for 
the replacement A{t) — > —A{t). 

Since U{T,Q) = exp (0(r)) and W{T,Q) = 
exp(— ri(T)). we conclude that under the replacement 
A{t) -> -A{t), n{T) transforms as fi(T) -^{T) + 
i2Tr£, for some integer i. In fact, since the integer £ cannot 
jump discontinuously when A(t) is smoothly deformed, i 
must be a constant independent of A{t), and by taking 
the limit A{t) — )■ we see that ^ = 0; thus fl{T) changes 
sign under A{t) —A{t), i.e., is an odd function of A{t). 

In general, f2„ (T) is an integral of an expression con- 
taining n factors of A{t). Thus, fln{T) is invariant under 
the replacement A{t) — >■ —A{t) for n even, and changes 
sign under this replacement for n odd. Since in the time- 
symmetric case r2(T) changes sign under A{t) — > —A{t), 
we conclude that f2„(T) vanishes for n even. □ 



Appendix C: Error estimate for time evolution 

Here we prove: 

Lemma 3. Suppose that the time evolution operator U{t) 
satisfies the differential equation 



-U{i) = -iH{t)U{t) 



(CI) 



with the initial condition U{tQ) — Uq, while U{t) satisfies 

(C2) 



jUit) = -iH{t)U{t) 



with the same initial condition, where both H{t) and H{t) 
are Hermitian. Then 



Proof. 



\\U{t)^U{t)\\ < / ds\\H{s)-H{s)\\ 

Jto 



\\U{t)-U{t)\\ = \\U{t)U{t)-'-l\\ 



(C3) 



ds— 

to ds 



{u{s)U{s)-' 



< I ds 

'to 



< I ds 
It 



-I I ds U{s) (h{s) - H{s)\ U{s)-^ 

ta 

U{s) [h{s) - H{s)) C/(s)-i 
H{s)~H{s 



(C4) 
□ 
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In this paper, we use Lemma |3] in three ways. In one 
application, we consider the case where both Hamihoni- 
ans are time independent, and conclude that (compare 
with Eq. dm) 



\\U{t)-U{t)\\<{t-to)\\H-H\\ 



(C5) 



This inequality allows us to relate the effective noise 
strength 7700 achieved by dynamical decoupling to our 
bounds on the terms in the Magnus expansion. 

In another application, we consider H to be Hs + Hb, 
where Hs governs the ideal system dynamics and Hb 
governs the bath dynamics, while the Hamiltonian for the 
noisy joint evolution of system and bath is H = H+Hsbi 
where HgB is responsible for the noise. Then in the local- 
bath model, if ||ffsB|| < J and a gate is executed in 
time To , Lemma [3] implies that the norm of the "bad" 
part of the gate is bounded above by Jtq . Thus we may 
estimate the effective noise strength in the absence of DD 
as 77 = Jtq, as in Eq. ([55]) . 

In the third application, we use Lemma |3] to estimate 
the error arising from pulses with nonzero width. We 
consider H{t) to be the Hamiltonian describing the ac- 
tual DD sequence with realistic pulses, and H{t) to be 
the idealized evolution for zero-width pulses, where both 
Hamiltonians are expressed in the toggling frame deter- 
mined by the ideal sequence. Suppose that there are 
R pulses, and that each realistic pulse has support in a 
time interval of width S. Both H{t) and H{t) can be 
expressed as a sum of a bath Hamiltonian and an error 
Hamiltonian; the bath Hamiltonian cancels in the dif- 
ference H{t) — H{t), and we suppose that for both the 
realistic and ideal sequences the norm of the error Hamil- 
tonian is bounded above by J during the pulses. Thus 
\\H{t)—H{t)\\ < 2 J during the pulses (a total duration of 
Rd), while H{t) = H{t) outside the pulses; thus Lemma 
[3] implies ||C/(T) - U{T)\\ < 2RSJ, as in Eq. mbh . 



Appendix D: Bounds for even Magnus terms in the 
time-symmetric case 

We want to generalize the argument used to compute 
the bound for il2{T) in the case where HM{t) is time- 
symmetric except for t e A. To do this for higher-order 
terms requires a formula for the Magnus terms for which 
all the multiple time-integrals arc explicit. Such a for- 
mula can be found in [tJ (for n > 2): 

r2„(T) = - / dti . . . [ dtnLn 
n Jo Jo 

X [[...[A{h),A{h)],...],Ait^)] (Dl) 

where 

n— 1 ^ n—l 

(D2) 



1=1 



i<jl<---<jn-l<n ra=l 



The Ln coefficients take care of the time-ordering and re- 
labeling of the integration variables. For n even, follow- 
ing what we did in the ^2 (T) case, we split up the n time- 
integrals into n different cases: (1) none of ti,i = 1, .. .n 
are in A, (2) exactly one of ti <E A, (3) exactly two of 
ti e A, . . ., (n) exactly n of G A. Case (1) is zero from 
the time symmetry of HM{t) for i ^ A; the remaining 
cases we bound by first bounding the nested commutator 
and Ln, and then doing the time- integral. 

The {n — l)-ncsted commutator can be bounded as 



< 2*^ 



[A{h),A{t2)],...],A{tn)]\\ 
'■\\[Aih),Ait2)]\\ \\Ait,)\\...\\Ait,M 



< 2"-^(4Je)e 
= 2" Je"- 



ri-2 



(D3) 



The 2"~^ factor in the first line comes from opening up 
(n — 2)-ncstcd commutators using submultiplicativity of 
the operator norm. The (4 Je) factor in the second line is 
an upper bound on ||[A(ii), A(t2)]|l- The coefficient i„ 
can be bounded by ignoring the step function (i.e. ignor- 
ing the time-ordering, since we do not have the details of 
A anyway): 



n-l 



;=1 l<Ji<j2<---<J„-l<n 
n-l / N n-l , 

El I n — l\ -sr-^ 1 /n—l 



I \ n — I J ^-^ n — I \ I 

1=1 ^ ^ 1=1 



(D4) 



The binomial factor arises from counting the number of 
terms in the sum over ji: we pick n — I elements from 
the numbers 1 to n — 1, and arranging them in ascending 
order gives a single choice of (ji, j2, ■ • ■ , jn-i) and hence a 
single term in the sum. The number of ways of choosing 
n — I elements from n—l distinct numbers is given by the 
binomial factor. To bound the remaining sum, consider 



1=0 

n-l 

= E 



71 - 1\ 1 



1=0 



I J n — I 
n-l\ 1 



x=l 



I J n — I 



(D5) 



Therefore, we have that 



\Ln\< / dx{l+xy 
JO 

= - (2"-i - 1) . 



77- 1\ 1 



n 



(D6) 



Putting these back in f2„(T) (77, even) and doing the 
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time-integrals, we find that 

< \ (2"-i - 1) (2"Je"-i) 



A2(T- A)' 



A" 



1) [T" - (T - A)"] . (D7) 



In the first inequality above, the terms in the brackets 
arc the n — 1 cases for choosing the times ti , . . . , t„ , with 
at least one being in A. 

One can check that Eq. (|D7|) agrees with Eq. (|14ip in 
the n = 4 case. We also see that, for each n, il„(T) is of 
order AT^~^ , and thus vanishes in the limit A — )■ 0. 



Appendix E: Sn ^ coefficients 



Here we derive bounds on the Sii coefficients found in 
the recursive formulas (Eqs. (|84ap - (|84ep ) for the Mag- 
nus terms. 



Lemma 4. For all n > 2, 1 < j < n — 1, 

\\Sl^Hm<fL'^J{2etr-\ (El) 
where the coefficients are defined recursively: 

/f^ = l, fi°^=0, n>l, (E2a) 

n-j m-1 I I 

/^■^=2EE^/i^Vr„V, n>2; (E2b) 

m=l p=0 

/lere the {Bp} are the Bernoulli numbers, defined by 



1 ^ p! 



(E3) 



p=0 



types of 

Sl^\t) - [a.-i(t), -»ifA/(t)] ; (E5a) 
Sir'Ht)-[^iit),st\'\t)]; (E5b) 

m=2 

for2<j<n-2. (E5c) 

Note that the last case occurs only for n > A. We will 
bound each case separately. First, for Sn\ 

\\si'Ht)\\<2\\n„^,it)\\\\HMm 

n-2 



p=l ^' "'0 



ji-2 



Eq. m becomes /^^^ - 2E;ri ^^{f^/l-i when j = 1; 
therefore \\Si^^\t)\\ < fi^^ J{2et)"^\ 
Next we bound S'n"^^'': 

||5("-i)(i)j|<2||17i(i)|l||5i"_f)(t)|l 

</^'lf'j(2et)"-i (E7) 

Eq. (IE2| becomes /^""^^ 2/^"^^' when j = n - 1; 

therefore < f^^^^ J{2et)"-\ 

Lastly, the 2 < j < n — 2 cases: 

<2\mt)\\\\si;i,'\t)\\ 

m=2 p=l P' ^-^0 ' 



<f'J-,''j{2ety 
+ Jh{2ety 



\^p\ r(p) Aj-l) 
z — ^ z — ^ n^m 



ni—2 p—1 



plm 



(E8) 



The expression within the brackets in the last line looks 
like fn"^ in Eq. (jE2p , except we need to add in the m ~ 1 
terms, as well as the p = terms. In fact. 



n-j m-1 I I 
2 \^p\ dp) fU-l) 

jL^ n^m Jn-m 



m— 2 p—1 



p'.m 



Proof. We will prove the lemma by induction. We begin 
with the smallest case where n = 2, j = 1: 

\\si'\t)\\^m,it),-^HMm 

< f ds II [HM{s),HMit)]\\. (E4) 
Jo 

The commutator can be bounded as ||[_ffM(s),^M(i)]|| < 
4Je. This thus gives ||4^^(t)|| < 4 Jet. Since f^^^ = 2, 
this can be rewritten as 115*^^^11 < AJet ^ f!^^\j{2et). 
For a given n > 3, suppose that the lemma holds for all 

for m < ri, 1 < p < TO — 1. There are three different 0. Putting this into ||S'^"'''(t)||, and using the fact that 



DM ^ •'n-l nlm ™ Jn-m 



0!1 

fO) _ Of 0-1) 



(E9) 



where in the last line, we have used the fact that 
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J < e, we get 

\\si=Hm<.f!r'^Ji2etr-' 

+ J(et)(2et)"-2 
This completes the induction. 



Jn J 71-1 



(ElO) 
□ 



the domain s G [— 27r, 27r]. It is also clear that y{0) 
0. Lemma [5] thus tells us that we can write G~^{y) 
where /„ is given in Eq. ({£2]) with 



s / s 
2 + - 1 - cot - 
2 V 2 



(F5) 



Appendix F: /„ coefficients 

In [5^, the {/«} were shown to be coefficients in the 
power series expansion of of G~^{y) = X^^i fnU^, the 
inverse function of 



y = Gis) 









/ dx 




(i-cot|); 


'0 







(Fl) 



Here, we will provide a self-contained proof of the above 
claim. It suffices to show that the coefficients of can 
be written in the form Eq. pisp . with fn^ defined via 
the recursion relations ()E2p . 

First, we prove a lemma that applies to a general func- 
tion y{s): 

Lemma 5. Suppose the smooth function y = G(s) is 
monotonic on its domain and satisfies y(0) = 0. Then 
G~^{y) can be written as fnV" where 



fn — . 
n! 



(F2) 



s=0 



and ^ = — 3-T . 

as g{s) 



Proof. Since y is monotonic on its domain, the inverse 
function G^^{y){= s) exists and has derivatives 



d" 
dy^ 



G~\y)^i9{s) 



ds 



5(5) 



ds 



9{s), (F3) 



where in the first equality, we have used the chain rule 
of differentiation: = ^^f ^ ( ^] -f = g{s)4-. 

ay ay as \as I as ^ \ / ds 

Since y is a smooth function on its domain, so is g{s) 
and hence all derivatives of G~^{y) exist. We can then 
expand G~^{y) as a Taylor series about y ~ and write 
G~^{y) = SJ^o some coefficients /„. We see 
that /o = since G-i(O) = 0. For n > 1, the Taylor 
coefficients are given by 



(F4) 



which, upon inserting Eq. (jF3p and noting that y(0) = 0, 
immediately gives Eq. (jF2p . □ 

For our purposes, the function y{s) is given in Eq. (|Fip . 
i.e. y{s) = G(s) which is smooth and monotonic over 



Lemma 6. The coefficients /„ in G ^{y) — X^i^i 

can be written in the form Eq. (|118p . with fn^ defined 

according to Eq. (|E2p . 



Proof. For n ~ 1, the index j in Eq. (jllSp can only take 
value 0, so /i can be written in the form Eq. (|118p if we 
set /I"'' = 1. To handle the case n > 2, we use Eq. (|E3p 
to expand cot(s/2) in terms of Bernoulli numbers, finding 

I cot (0 =Bo+ (^B, + i^s) + ^{^sy. (F6) 



Noting that Bq = 1, Bi = -1/2, B2J+1 = for j > 1, 
Bij < for j > 1 and > for j > 0, Eq. (|F6)) 

becomes 



^(^)=2 + i(l-cot0=E^«^- 



(F7) 



Using this series expansion of (?(s), we can rewrite (jF2 
for fn>2 as: 



1 \Bi\ 2"-i 



n2'' 



ds 



s=0 



(F8) 

We omit the j = term in Eq. (|F8p because the deriva- 
tive of a constant vanishes, and the sum over j termi- 
nates at J = n — 1 because higher-order terms vanish 
when we set s = 0. Thus /^>2 = Oi and by comparing 
with Eq. ([TTOI we define f^^^ as 



{n-iy. 



ds 



(F9) 



s=0 



Now we need to show that the {fn^} obey the recursive 
relation (jE2p . Using our definition of /^•''' from (jF9p . the 
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right-hand side of Eq. (jE2l) can be rewritten as 

n-j m-1 I „ 

p\m 

m— 1 



2 \^Ap)Aj~i) 

rjlr- "^^^^ ■'^^ 



m— 1 p— 
n— J 771—1 



2EE 



Bp I 2™-i 



= 1 p=o ^ ^ 



m— i p 



(n — TO — 1)! 



ds ^ 

71— ni— 1 



s=0 



2„_i "-i 
(n- 1)! ^ 



n — 1 

771 



-1 I R I 

P=0 ' 



^ ds 



(FIO) 



i5=0 



The expression ^X^pLo^ '^^F'^^) ^® J^^^ '"^'^ 
tend the upper hmit of the sum to infinity. We can indeed 
do this, because in the equation above, the expression 
is differentiated to — 1 times and s is set to 0. Hence, 
higher-order terms in the power series expansion of 17(5) 
with p > TO do not contribute. Therefore, 

n-] m-l I I 

Z / Z / 'p\jji 



?n — 1 p— 



-1 ^-J 



(n- 1)! 



9- 



^ ds 



rn — l 



s=0 



n—m— 1 



- 1 

(71 — 1)! \ TO 

^ ' m=l \ 



- ± - ~ 

T)! ^ 



^ ds 

n— 1— m 



(Fll) 



Now, for any differential operator 2? satisfying the prod- 
uct rule, i.e. 'Dixy) = 'D{x)y + x'D{y) (where x and y 
commute), I?" has the binomial expansion 



Take V = g^, x = s and y = ^ . Then (note that the 
TO = term is zero). 



ds 



Tl-l 

= E 

=0 rn=0 



n- 1 
m 

d 
^ ds 



ds 



s=0 



(F13) 

Putting this into (jFll[) gives exactly the expression for 



fi^'^ in dESl) 



□ 



Appendix G: Pure-bath term in second order of the 
Magnus expansion 

Here we consider the case where the toggling-frame 
Hamiltonian has a decomposition Hit) = Hb + Hc„{t) 
such that 

a 

where Sa{t) = Ul{t)SaUc{t) and the operators {So] are 
a Hermitian basis for traceless operators acting on the 
system such that 



iY{S^Sp)^Q 



(G2) 



for all a ^ p. We further assume that each pulse either 
commutes or anticommutes with each S'^, so that 

S^{t) - Ul{t)S^U,{t) = ±5„, (G3) 



and hence 



where Ca(*) = il- These assumptions are true, in par- 
ticular, for an n-qubit system if each Sa and each pulse 
is a traceless n-qubit Pauli operator. We will show that 
under these assumptions the second-order term in the 
Magnus expansion 

f^2(T) = / dh ( ' dt2 [H{h), H{t,2 )] (G5) 



contains no pure bath term; that is, tTs{^l2{T)) = 0. 

Because [Hb,Hb] = 0, it suffices to show that 
the system trace vanishes for [Hb, Hcii{ti)] and 
[i7crr(ii),ffcrr(<2)] for any ti,t2 G [0,r]. First we ob- 
serve that 

[HB,H,,,{t)] = ^CaW[So®I,B„®5J 

a 

= J2Ut)[Bo,Ba,]®S^ (G6) 

a 

has vanishing system trace. Next we note that if the 
product SaSp is traceless it can be expanded in the basis 
{^Q.}, so that 

SaSp — SapTa + ^ ga/B-fS^ (G7) 
7 

(where Ta might have a nonvanishing trace). Therefore 

Hcrr(il)ifcrr(i2) = E )'^" )B„B„ ® T„ -f • • • 

a 

Herr{t2)Herr{il) — ^ Co (^2)Ca (^1 j^a-^Q (g) + ' ' ' 
a 

(G8) 

where the ellipsis represents terms with vanishing system 
trace. Thus in the commutator [-fferr(^i)j ^oir(i2)] the 
terms proportional to T„ cancel, and what remains has 
vanishing system trace, as we wished to show. 
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Appendix H: Noise parameters for concatenated 
dynamical decoupling 



For the analysis of concatenated DD in Sec. IVIIH we 
considered dividing the third-order term in the Magnus 
expansion fia into a pure bath term and a remainder. For 
that purpose we may use: 

Lemma 7. Suppose an operator O has a decomposition 
= I®Bo + ^S'„®B„, (HI) 

a 

where both terms are Hermitian and tr (Sa) = for each 
a. Then 



\Bn\\ < \\0\ 



and 



Y^S^^BJ < 2\\0\' 



(H2) 



(H3) 



Proof. To derive Eq. (jH2| . suppose that lip) is a normal- 
ized pure state such that I (-01 Bo IV') I = ll-Boll J and consider 
the expectation value 



(H4) 



where |x) is also a normalized pure state (of the system). 
The right-hand side of Eq. ()H4|) is the expectation value 
in the state |x) of a traceless Hermitian operator. Unless 
this operator is zero, the expectation value can be either 
positive or negative depending on how \x) is chosen. By 
choosing |x) so that the expectation value {J2a ®Ba) 
in the state |x) is either zero or has the same sign 

as (I (g) Bq), we have 



|(0)|>|(I®i?o)|, 



(H5) 



and using ||I(g) BqH = ||Bo||, Eq. pE]) follows. From the 
triangle inequality, 

||^S'„®S„|| = IIO-I^Boll < 11011 + P«)Bo|| < 211011, 



which proves Eq. (|H3 



(H6) 
□ 



The inequality Eq. (jH3[) is tight if we do not restrict 
the dimension of the system, but if the system is a qubit 
(two dimensional), it can be improved to 



IE-. 



BJ\ < IIOIl 



(H7) 



For a qubit, there is an anti-unitary time-reversal opera- 
tor T : IV') — > CTylV')* such that T^daT = — (Tq. Suppose 



IV") is a normalized pure state such that K^'ISa''"" 
BaH)] = \\Y.a'^a<»Ba\\. By applying T (g) I if necessary, 
we can choose \tp) so that {J2a ® Bo) and (I Bq) 
have the same sign (unless (I® i?o) = 0). Therefore 



l(0)|> 

and Eq. (|H7)) follows. 



(H8) 



Appendix I: Relating distance between operators to 
distance between their exponentials 

Here we prove: 
Lemma 8. 

Ile^-e^ll > 2\\A-B\\ 



-2expQ||A + i?||)sinhQ||A-i3| 
Proof. Expanding the exponentials, we obtain 

oo ^ 

e'^-e^ ^A-B + y — \A"- B"] , 
and therefore 

oo 

||e^-e«||>||A-i?||-^-||A"-i?" 

Z ' 17,1 

Defining 

N = 
we have 

A" -B" = {N + Aiy -{N- Af)", 



n=2 



liA + B), M^^iA-B), 



(11) 



(12) 



(13) 



(14) 



(15) 



and when we apply the binomial expansion to (iV-|-M)"— 
{N — Af)" the terms even order in AI cancel. There are 
^(m) terms of order m in AI for m odd, each with an op- 
erator norm bounded above by || A/||"'|| A^||"^™; therefore 



|A"-B"|| < 2 ^ 



|A-f||'"||iV|i"^™ 



= (||iV|| + ||A/||)"-(||iV||-||A/|l)" 



(16) 



Thus wc find 



f;lp"-S"|| <exp(||7V|| 



lA/l, 



n=2 



-exp(||iV||-||A/||)-2||A/|| 

= exp(||iV||).2sinh(|jAf||)-2||Af|| 



and substituting into Eq. p3|) yields Eq. (|IT 



(17) 
□ 
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If the norm of the sum A + B is not too large, we can 
use Lemma [5] to show that A is close to B when is 
close to . For example, suppose that 



||A + i?j|<e+, p-i?||<e 
Then Lemma [5] implies 

\\A-B\\<c{e+,e 

where 



c(e+,e_) = 2-e' 



2 sinhe-/2 



(18) 
(19) 

(110) 



For example, if e+ = e_ = 0.3, we find c{e^, e_) = 1.20. 



Appendix J: Bath-state-dependent noise strength 
and Dyson expansion 

In the local-bath model, the noisy operation applied 
at the circuit location a is a unitary transformation Ga 
acting jointly on the system and bath. We may express 
Ga as the sum of a "good" part Ga = Ga ^ Ba (where 
Ga is the ideal gate), and a "bad" part Ba — Qa — Ga- 

The accuracy threshold theorem proved in [sil, [s^ es- 
tablishes that quantum computing is scalable provided 
the noise strength fj is smaller than a critical value 770. 
For this purpose, the noise strength may be defined in 
the following way. Recall that we model a noisy prepara- 
tion of a qubit as an ideal preparation followed by noisy 
Hamiltonian evolution for a prescribed period. There- 
fore, we may assume that the initial state of the system 
at the very beginning of a quantum computation is ideal, 
and that the initial state of the system and bath is a 
product state 



(Jl) 



It is convenient to introduce a reference system R that 
purifies the initial state of the bath; then the initial state 
of system, bath, and reference system is a pure state 



= l*°5> 



where 



tri 



(J2) 



(J3) 



Now consider a quantum circuit acting on the initial state 
I^S_Bfl)i ^n-d let Xr denote a set of r locations in the cir- 
cuit. Let U^'^'^{Xr) denote the transformation that results 
if we place the noisy gate Ga at each location a ^ Ir and 
place the bad part Ba at each location a £ Xr, U^^'^{Xr) 
acts trivially on R. We may say that the noise strength 
is f] if 



(J4) 



for any set Xr of r locations [32 



Since each Ga is unitary and therefore has operator 
norm 1, the submultiplicative property of the norm im- 
plies 

||C/'^-d(I.)|$")|| < \\U''^^iXr)\\ < n ll^ail. (J5) 

Therefore, we may choose the noise strength to be 

^ = max||Sa||, (J6) 



We used this definition for the analysis in Sec. IIVIIVIIH 
based on the Magnus expansion, of the effective noise 
strength achieved by dynamical decoupling. 

The threshold theorem can be formulated in a more 
general way [111, [s^ , so that the local-bath assumption is 
not really needed to define the noise strength or prove the 
theorem. We adopt the local-bath model in this paper so 
that we can study the efficacy of the DD pulse sequence 
for each circuit location individually; otherwise we would 
need to include noise correlations among distinct gates 
that are executed simultaneously, which would greatly 
complicate the analysis. 

The expression [3l|, [s^ for the noise strength does not 
depend on the initial state of the bath, but for the analy- 
sis of the effective noise strength in Sec.|XJ based on bath 
correlation functions and the Dyson expansion, we use a 
different definition of fj that does depend on the initial 
state of the bath. To state the new definition simply, it is 
convenient to put a further limitation on the noise model 
that was not needed in the Magnus expansion analysis — 
we assume that the state of the bath is discarded at the 
end of each circuit location, and replaced by a fresh bath 
state at the beginning of the next location. We admit 
that this new more restricted noise model is even more 
artificial than the local-bath model we analyzed previ- 
ously using the Magnus expansion. In a rather perverse 
compromise, we include the effects of the bath's memory 
in our analysis of the DD pulse sequence at each circuit 
location, but assume such effects are negligible when we 
stitch the DD-protectcd gates together in a quantum cir- 
cuit. 

Under this assumption, the noisy operation at location 
a is applied to a product state, where the initial state 
PB,a of the local bath for location a does not depend on 
the noisy operations applied at earlier circuit locations. 
Thus Eq. (jJ4[) is satisfied if we define 



ri = max 



|6a(|*)®|<f>a))||, 



(J7) 



where |$ a) is R purifiCRtioil of PB,ai 

and the maximum 

is over all circuit locations and over all pure states of 
the system. In terms of the interaction-picture operator 
applied at location a. 



Ua - GlGa = G\ {Ga + Ba) = la + GlBa, 



(J8) 



we may write ry as 



7? = max 



(C/a-Ia) (|*)®|$a))||, (J9) 
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or equivalently 



f - max((c7t-I.)([7_I„)), 

= max (21, - [7a - C/A , (JIO) 

a,!*) \ / 



where (•) denotes the expectation value in the state Y^) ® 
|$a)- This is the formula used in Eq. (|209|) in Sec.|Xl 

Now we can explain how the analysis would need to 
be modified if we relaxed the assumption that the bath 
is refreshed at the beginning of each circuit location. In 
the proof of the threshold theorem, we need to derive an 
upper bound not on the amplitude for a fault at a single 
circuit location, but instead on the amplitude for faults 
occurring at each of the r specified locations in the set 
Ir, as in Eq. (|J4p . Therefore, in our expression for ff in 
Eq. (jJ10[) . we should consider the state |<i>a) of the bath 
to be not the actual bath state at the beginning of loca- 
tion a, but rather the conditional state of the bath, given 
that faults have already occurred at a specified set of pre- 
vious locations. In |33| we obtained an upper bound on 
ff for the case of Gaussian noise by doing a global anal- 
ysis of the whole quantum circuit — generalizing that 
analysis to DD-improved gates seems difficult. On the 
other hand, we may still express ff' as in Eq. (jJlOp in 
the more general setting (without assuming the bath is 
refreshed), with the proviso that ff' is maximized over 
all such conditional bath states. This is not a very use- 
ful criterion as it stands, since this value of ff cannot 
be easily extracted from any feasible experiment. But it 
could become more useful were wc able to infer proper- 
ties of the bath correlations in the conditional state from 
weaker assumptions about the noise model. 

To derive Eq. (|210p . consider a location with dura- 
tion T . The interaction-picture time-evolution operator 
is given by Dyson's formula 



[/(T) = rexp 



dt H{t) 



(Jll) 



where T denotes time-ordering and H{t) is the 
interaction-picture Hamiltonian, which obeys = 
ll-ffoirll < J ■ Expanding the exponential, we find 



U{T)^l+Y^Un{T), 



(J12) 



n=l 



where 
and hence 



n{T) = j dti--- dtnT (^(il) • • • H{tn)) , 

(J13) 



\UniT)\\ < ^T-\\Hit,) ■ ■ ■ iJ(t„)ll < (J14) 



(JTY 



Similarly, W{T) has the expansion 

00 

c/t(r) = i + ^t/t(r), (J15) 



where 



Ul{T) = ^ r dt,--- dt^r (H{t,) - - - H{t^) 

(J16) 

here T' denotes reverse-time ordering, and again 

\\UliT)\\ < (J17) 
nl 

Noting that C7i(r) + ijl{T) = 0, we find 
(2l-U{T) - U^T)'^ 

00 

< - (U-2{T) + ul{T)) + ^ ||f/„(r) + ui{T)\\ 



n— 3 



< -(t>2(T) + L/|(T))+2^ 



(JT)" 



n— 3 



< - {U2{T) + Ul{T)J + 2 (^e'^^ - 1 - JT - -{JT)' 

(J18) 

To evaluate the expectation value of U2 (T) + f/^ (T) , we 
observe that 



T[H{ti)Hit2))+r [H{t,)H{t2)^ 
= H{ti)H{t2) + H{t2)H{ti), (J19) 



so that 



U2{T) + Ul{T) 



dtidt2 H{ti)H{t2) + H{t2)H{ti) 



dtidt2H[tl)H{t2). 



Finally we may express the noise strength as 



ff = max / dtidt2 {H{ti)H{t2) 



(J20) 



2 [ e'''^ - 1 - JT - -{JT)'^ ] , (J21) 



as in Eq. (piU)) . 
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